DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_0a0458a3 Viewer • Updated about 5 hours ago • 764
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_c267e2e6 Viewer • Updated about 9 hours ago • 509
DCAgent/eval-terminal-bench-2.0-claude-haiku-4-5-20251001-20260115_165217 Viewer • Updated about 10 hours ago • 272
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_a60f4588 Viewer • Updated about 11 hours ago • 371
DCAgent/eval-terminal-bench-2.0-gpt-5-mini-2025-08-07-20260115_093339 Viewer • Updated about 20 hours ago • 269 • 7
DCAgent/eval-terminal-bench-2.0-gemini-2.5-flash-20260114_222605 Viewer • Updated 1 day ago • 312 • 6
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gpt-5-nano-2025-08-07-20260114_142654 Viewer • Updated 1 day ago • 293 • 2
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gpt-5-mini-2025-08-07-20260114_222454 Viewer • Updated 1 day ago • 300 • 4
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gemini-2.5-flash-20260114_200318 Viewer • Updated 1 day ago • 339 • 4