Agentic AI Atlas

II.

Benchmark overview

benchmark:gaia

Reference · live

GAIA overview

General AI Assistants benchmark — real-world agent reasoning tasks.

BenchmarkOutgoing · 1Incoming · 5

displayName

GAIA

homepageUrl

kind

agent-reasoning

targetsKind

AgentVersion

description

General AI Assistants benchmark — real-world agent reasoning tasks.

covers1

bounds_subject1

evaluated_by1

for_benchmark1

scored_against1

split_of1