Atlas Graph Explorer
Wiki
Graph
Edges
Home
EvalRun
eval-run:truthful-qa.claude-opus-4-5.2025-09
eval-run:truthful-qa.claude-opus-4-5.2025-09
eval-run:truthful-qa.claude-opus-4-5.2025-09
EvalRun
benchmarks/eval-runs/eval-runs-anthropic.yaml
·
Open in Graph →
overview
json
graph
Attributes
target
model:claude-opus-4-5@current
benchmarkId
benchmark:truthful-qa
testSetId
test-set:truthful-qa-mc
targetId
model:claude-opus-4-5@current
runAt
2025-09-29T00:00:00Z
runBy
anthropic
configHash
sha256:placeholder-claude-opus-4-5-truthful-qa
Outgoing edges
(3)
evaluates_target
1
model:claude-opus-4-5@current
·
ModelVersion
for_benchmark
1
benchmark:truthful-qa
·
Benchmark
TruthfulQA
uses_test_set
1
test-set:truthful-qa-mc
·
TestSet
TruthfulQA — multiple-choice
Incoming edges
(1)
belongs_to_eval_run
1
eval-result:truthful-qa.claude-opus-4-5.001
·
EvalResult