Ye, Shengyu

3 publications

ICLR 2026 Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs Xumeng Wen, Zihan Liu, Shun Zheng, Shengyu Ye, Zhirong Wu, Yang Wang, Zhijian Xu, Xiao Liang, Junjie Li, Ziming Miao, Jiang Bian, Mao Yang
ICML 2025 CursorCore: Assist Programming Through Aligning Anything Hao Jiang, Qi Liu, Rui Li, Shengyu Ye, Shijin Wang
AAAI 2025 VERSE: Verification-Based Self-Play for Code Instructions Hao Jiang, Qi Liu, Rui Li, Yuze Zhao, Yixiao Ma, Shengyu Ye, Junyu Lu, Yu Su