subjectId
inScope
EvalPlus-augmented MBPP — adds large numbers of additional auto-generated tests to the original Mostly Basic Python Problems suite to detect overfitting.
outOfScope
Repository-scale tasks, advanced or competition-level problems (use APPS / LiveCodeBench), and non-Python languages.
outOfScopeReasonIds