Scale AI
company
Verified
AI & ML interests
None defined yet.
Recent Activity
Papers
Agentic Rubrics as Contextual Verifiers for SWE Agents
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents
datasets
21
ScaleAI/lhaw
Viewer
•
Updated
•
285
•
8
ScaleAI/SWE-bench_Pro
Viewer
•
Updated
•
731
•
23.1k
•
49
ScaleAI/audiomc
Viewer
•
Updated
•
452
•
828
•
6
ScaleAI/SciPredict
Viewer
•
Updated
•
405
•
103
•
1
ScaleAI/PRBench
Viewer
•
Updated
•
1.65k
•
604
•
6
ScaleAI/MCP-Atlas
Viewer
•
Updated
•
500
•
627
•
7
ScaleAI/VisualToolBench
Viewer
•
Updated
•
1.2k
•
84
•
3
ScaleAI/dummy_mcp
Viewer
•
Updated
•
16
•
20
ScaleAI/researchrubrics
Viewer
•
Updated
•
101
•
174
•
17
ScaleAI/swe-oec-claude-expert
Viewer
•
Updated
•
1.27k
•
63
•
1