Tang, Yuheng

4 publications

ICLR 2026 DevOps-Gym: Benchmarking AI Agents in Software DevOps Cycle Yuheng Tang, Kaijie Zhu, Bonan Ruan, Chuqi Zhang, Michael Yang, Hongwei Li, Suyue Guo, Tianneng Shi, Zekun Li, Christopher Kruegel, Giovanni Vigna, Dawn Song, William Yang Wang, Lun Wang, Yangruibo Ding, Zhenkai Liang, Wenbo Guo
NeurIPS 2025 Co-PatcheR: Collaborative Software Patching with Component-Specific Small Reasoning Models Yuheng Tang, Hongwei Li, Kaijie Zhu, Michael Yang, Yangruibo Ding, Wenbo Guo
ICML 2025 PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification Hongwei Li, Yuheng Tang, Shiqi Wang, Wenbo Guo
NeurIPS 2025 SECODEPLT: A Unified Benchmark for Evaluating the Security Risks and Capabilities of Code GenAI Yuzhou Nie, Zhun Wang, Yu Yang, Ruizhe Jiang, Yuheng Tang, Xander Davies, Yarin Gal, Bo Li, Wenbo Guo, Dawn Song