Atlas Graph Explorer
Wiki
Graph
Edges
Home
Benchmark
benchmark:human-eval
HumanEval
benchmark:human-eval
Benchmark
benchmarks/benchmarks/human-eval.yaml
·
Open in Graph →
overview
json
graph
Attributes
displayName
HumanEval
homepageUrl
https://github.com/openai/human-eval
kind
function-completion
targetsKind
ModelVersion
description
Hand-written programming problems for evaluating code generation.
Outgoing edges
(1)
covers
1
skill-area:python-implementation
·
SkillArea
Python Function Implementation
Incoming edges
(11)
bounds_subject
1
scope-boundary:human-eval.scope
·
ScopeBoundary
for_benchmark
10
eval-run:human-eval.qwen-2-5-72b.2024-09
·
EvalRun
eval-run:human-eval.qwen-2-5-coder-32b.2024-11
·
EvalRun
eval-run:human-eval.claude-sonnet-4-6.2025-11
·
EvalRun
eval-run:human-eval.deepseek-v3.2024-12
·
EvalRun
eval-run:human-eval.llama-4-405b.2024-07
·
EvalRun
eval-run:human-eval.llama-3-1-405b.2024-07
·
EvalRun
eval-run:human-eval.llama-3-3-70b.2024-12
·
EvalRun
eval-run:human-eval.mistral-large-2.2024-07
·
EvalRun
eval-run:human-eval.codestral-25-01.2025-01
·
EvalRun
eval-run:human-eval.gpt-5.2025-08
·
EvalRun