Scale AI
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Agentic Rubrics as Contextual Verifiers for SWE Agents
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents
datasets 25
ScaleAI/SWE-Atlas-QnA
Viewer • Updated • 124 • 370 • 14
ScaleAI/RaR-Medicine
Viewer • Updated • 22.4k • 31 • 1
ScaleAI/RaR-Science
Viewer • Updated • 22.9k • 33 • 1
ScaleAI/SWE-bench_Pro
Benchmark • Updated • 731 • 515k • 56
ScaleAI/mrt
Updated • 14.2k • 4
ScaleAI/audiomc
Viewer • Updated • 452 • 996 • 13
ScaleAI/lhaw
Viewer • Updated • 285 • 183 • 4
ScaleAI/SciPredict
Viewer • Updated • 405 • 20 • 2
ScaleAI/PRBench
Viewer • Updated • 1.65k • 456 • 6
ScaleAI/MCP-Atlas
Viewer • Updated • 500 • 2.06k • 11