Atlas Graph Explorer
Wiki
Graph
Edges
Home
Judge
judge:gpt-4o-pairwise
GPT-4o pairwise preference judge
judge:gpt-4o-pairwise
Judge
benchmarks/eval-harnesses/judges.yaml
·
Open in Graph →
overview
json
graph
Attributes
displayName
GPT-4o pairwise preference judge
judgeKind
llm
rubricId
rubric:helpfulness-1-5
notes
Standard pairwise A/B preference judge using GPT-4o; emits a winner + rationale.
Outgoing edges
(0)
None.
Incoming edges
(1)
judged_by
1
eval-run:gaia.claude-code.2025
·
EvalRun