Atlas Graph Explorer
Wiki
Graph
Edges
Home
EvalRun
eval-run:gpqa-diamond.gpt-5.2025-08
eval-run:gpqa-diamond.gpt-5.2025-08
eval-run:gpqa-diamond.gpt-5.2025-08
EvalRun
benchmarks/eval-runs/eval-runs-openai.yaml
·
Open in Graph →
overview
json
graph
Attributes
target
model:gpt-5@current
benchmarkId
benchmark:gpqa
testSetId
test-set:gpqa-diamond-2024
targetId
model:gpt-5@current
runAt
2025-08-07T00:00:00Z
runBy
openai
configHash
sha256:placeholder-gpt-5-gpqa-diamond
Outgoing edges
(3)
evaluates_target
1
model:gpt-5@current
·
ModelVersion
for_benchmark
1
benchmark:gpqa
·
Benchmark
GPQA
uses_test_set
1
test-set:gpqa-diamond-2024
·
TestSet
GPQA Diamond — 2024 release
Incoming edges
(1)
belongs_to_eval_run
1
eval-result:gpqa-diamond.gpt-5.001
·
EvalResult