Tong, Yuxuan

5 publications

NeurIPS 2025 DAPO: An Open-Source LLM Reinforcement Learning System at Scale Qiying Yu, Zheng Zhang, Ruofei Zhu, Yufeng Yuan, Xiaochen Zuo, YuYue, Weinan Dai, Tiantian Fan, Gaohong Liu, Juncai Liu, LingJun Liu, Xin Liu, Haibin Lin, Zhiqi Lin, Bole Ma, Guangming Sheng, Yuxuan Tong, Chi Zhang, Mofan Zhang, Ru Zhang, Wang Zhang, Hang Zhu, Jinhua Zhu, Jiaze Chen, Jiangjie Chen, Chengyi Wang, Hongli Yu, Yuxuan Song, Xiangpeng Wei, Hao Zhou, Jingjing Liu, Wei-Ying Ma, Ya-Qin Zhang, Lin Yan, Yonghui Wu, Mingxuan Wang
ICML 2025 Demystifying Long Chain-of-Thought Reasoning Shiming Yang, Yuxuan Tong, Xinyao Niu, Graham Neubig, Xiang Yue
ICLRW 2025 Demystifying Long Chain-of-Thought Reasoning in LLMs Edward Yeo, Yuxuan Tong, Xinyao Niu, Graham Neubig, Xiang Yue
NeurIPS 2024 DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving Yuxuan Tong, Xiwen Zhang, Rui Wang, Ruidong Wu, Junxian He
NeurIPS 2023 ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation Jiazheng Xu, Xiao Liu, Yuchen Wu, Yuxuan Tong, Qinkai Li, Ming Ding, Jie Tang, Yuxiao Dong