displayName
OSWorld
homepageUrl
https://os-world.github.io/
kind
full-stack
targetsKind
AgentVersion
description
OSWorld (Xie et al., 2024) is a scalable real-computer environment
benchmark for multimodal agents performing open-ended tasks in
Ubuntu, Windows, and macOS with full GUI/keyboard/mouse access;
369 real-world tasks across browser, OS, and productivity apps.