II.
EvalRun overview
Reference · liveeval-run:gpqa.deepseek-r1.2025-01
eval-run:gpqa.deepseek-r1.2025-01 overview
Inspect the raw attributes, linked wiki pages, and inbound or outbound graph edges for eval-run:gpqa.deepseek-r1.2025-01.
Attributes
target
benchmarkId
testSetId
targetId
runAt
2025-01-20T00:00:00Z
runBy
deepseek
configHash
Outgoing edges
evaluates_target1
- model:deepseek-r1@current·ModelVersionDeepSeek R1
for_benchmark1
- benchmark:gpqa·BenchmarkGPQA
uses_test_set1
- test-set:gpqa-diamond-2024·TestSetGPQA Diamond — 2024 release
Incoming edges
belongs_to_eval_run1
- eval-result:gpqa.deepseek-r1.001·EvalResult