Cai, Matthew

1 publications

ICLR 2026 CyberGym: Evaluating AI Agents' Real-World Cybersecurity Capabilities at Scale Zhun Wang, Tianneng Shi, Jingxuan He, Matthew Cai, Jialin Zhang, Dawn Song