displayName
AppWorld
homepageUrl
https://appworld.dev/
kind
tool-use
targetsKind
AgentVersion
description
Tool-use benchmark with a controllable ecosystem of 9 simulated
apps (email, calendar, shopping, etc.) and 750 cross-app tasks.
Tests long-horizon multi-app orchestration.