displayName
CyberBench
homepageUrl
https://cybench.github.io/
kind
security
targetsKind
AgentVersion
description
CyberBench (Princeton et al.) measures language-model and agent
capability on real-world cybersecurity capture-the-flag tasks
spanning crypto, web, reverse engineering, pwn, and forensics.