Atlas Graph Explorer
Wiki
Graph
Edges
Home
EvalResult
eval-result:mmlu.qwen-2-5-72b.001
eval-result:mmlu.qwen-2-5-72b.001
eval-result:mmlu.qwen-2-5-72b.001
EvalResult
benchmarks/eval-results/eval-results-alibaba-qwen.yaml
·
Open in Graph →
overview
json
graph
Attributes
evalRunId
eval-run:mmlu.qwen-2-5-72b.2024-09
metricName
accuracy
score
0.861
unit
fraction
passFail
pass
reportedAt
2024-09-19T00:00:00Z
Outgoing edges
(2)
belongs_to_eval_run
1
eval-run:mmlu.qwen-2-5-72b.2024-09
·
EvalRun
scored_against
1
benchmark:mmlu
·
Benchmark
MMLU
Incoming edges
(1)
produced_result
1
eval-run:gaia.claude-code.2025
·
EvalRun