iiRecord
Agentic AI Atlas · TheAgentCompany
benchmark:the-agent-companya5c.ai
II.
Benchmark overview

benchmark:the-agent-company

Reference · live

TheAgentCompany overview

CMU benchmark simulating a real software-company environment (Gitea, RocketChat, Plane, OwnCloud, etc.) where agents complete consequential workplace tasks across tools.

BenchmarkOutgoing · 4Incoming · 0

Attributes

displayName
TheAgentCompany
homepageUrl
kind
full-stack
targetsKind
AgentVersion
description
CMU benchmark simulating a real software-company environment (Gitea, RocketChat, Plane, OwnCloud, etc.) where agents complete consequential workplace tasks across tools.

Outgoing edges

applies_to2
covers2

Incoming edges

None.