Atlas Graph Explorer
Wiki
Graph
Edges
Home
EdgeKinds
judged_by
judged_by
an EvalRun is judged by a Judge
3 wired pairs ยท cardinality N:N
from
to
to kind
eval-run:gaia.claude-code.2025
judge:gpt-4o-pairwise
Judge
eval-run:gaia.claude-code.2025
judge:claude-3-5-sonnet-rubric
Judge
eval-run:gaia.claude-code.2025
judge:exact-match
Judge