Atlas Graph Explorer
Wiki
Graph
Edges
Home
EvalResult
eval-result:gpqa-diamond.gpt-5-4.2026-03-17.accuracy
eval-result:gpqa-diamond.gpt-5-4.2026-03-17.accuracy
eval-result:gpqa-diamond.gpt-5-4.2026-03-17.accuracy
EvalResult
benchmarks/eval-results/eval-results-openai.yaml
·
Open in Graph →
overview
json
graph
Attributes
evalRunId
eval-run:gpqa-diamond.gpt-5-4.2026-03-17
metricName
accuracy
score
0.93
unit
fraction
passFail
pass
reportedAt
2026-03-17T00:00:00Z
Outgoing edges
(1)
belongs_to_eval_run
1
eval-run:gpqa-diamond.gpt-5-4.2026-03-17
·
EvalRun
Incoming edges
(0)
None.