Atlas Graph Explorer
Wiki
Graph
Edges
Home
EvalHarness
eval-harness:inspect-ai
Inspect AI
eval-harness:inspect-ai
EvalHarness
benchmarks/eval-harnesses/eval-harnesses.yaml
·
Open in Graph →
overview
json
graph
Attributes
displayName
Inspect AI
harnessKind
inspect-ai
homepageUrl
https://github.com/UKGovernmentBEIS/inspect_ai
description
UK AISI's evaluation framework for LLM agent and capability evals. Solver/scorer abstraction; first-class log and replay format.
Outgoing edges
(0)
None.
Incoming edges
(1)
uses_harness
1
eval-run:gaia.claude-code.2025
·
EvalRun