Atlas Graph Explorer
Wiki
Graph
Edges
Home
EvalRun
eval-run:math.o3.2025-04
eval-run:math.o3.2025-04
eval-run:math.o3.2025-04
EvalRun
benchmarks/eval-runs/eval-runs-openai.yaml
·
Open in Graph →
overview
json
graph
Attributes
target
model:o3@current
benchmarkId
benchmark:math
testSetId
test-set:swe-bench-verified-2024-12
targetId
model:o3@current
runAt
2025-04-16T00:00:00Z
runBy
openai
configHash
sha256:placeholder-o3-math
Outgoing edges
(2)
evaluates_target
1
model:o3@current
·
ModelVersion
for_benchmark
1
benchmark:math
·
Benchmark
MATH
Incoming edges
(1)
belongs_to_eval_run
1
eval-result:math.o3.001
·
EvalResult