subjectId
inScope
Grade-school math word problems (~8.5k problems) requiring multi-step natural-language arithmetic reasoning. Scored by exact-match on the final numeric answer.
outOfScope
Code-generation tasks, agentic tool-use, advanced or competition-level mathematics (use MATH), and multilingual variants.
outOfScopeReasonIds