Dai, Silong

1 publications

ICLR 2026 Group Verification-Based Policy Optimization for Interactive Coding Agents Silong Dai, Changzhi Sun, Haolun Wu, Huanran Zheng, Tao Ji, Junchi Yan, Yuanbin Wu, Dell Zhang, Xiaoling Wang, Xuelong Li