II.
EvalHarness JSON
Structured · liveeval-harness:helm
Stanford HELM json
Inspect the normalized record payload exactly as the atlas UI reads it.
{
"id": "eval-harness:helm",
"_kind": "EvalHarness",
"_file": "benchmarks/eval-harnesses/eval-harnesses.yaml",
"_cluster": "benchmarks",
"attributes": {
"displayName": "Stanford HELM",
"harnessKind": "helm",
"homepageUrl": "https://github.com/stanford-crfm/helm",
"description": "Holistic Evaluation of Language Models; broad multi-metric framework.\n"
},
"outgoingEdges": [],
"incomingEdges": [
{
"from": "eval-run:gaia.claude-code.2025",
"to": "eval-harness:helm",
"kind": "uses_harness",
"attributes": {}
}
]
}