Xu, Shusheng

8 publications

NeurIPS 2025 AREAL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning Wei Fu, Jiaxuan Gao, Xujie Shen, Chen Zhu, Zhiyu Mei, Chuyi He, Shusheng Xu, Guo Wei, Jun Mei, Wang Jiashu, Tongkai Yang, Binhang Yuan, Yi Wu
NeurIPS 2025 How Far Are We from Optimal Reasoning Efficiency? Jiaxuan Gao, Shu Yan, Qixin Tan, Lu Yang, Shusheng Xu, Wei Fu, Zhiyu Mei, Kaifeng Lyu, Yi Wu
NeurIPS 2025 Reasoning Is Not a Race: When Stopping Early Beats Going Deeper Mohan Zhang, Jiaxuan Gao, Shusheng Xu, Yi Wu
ICML 2024 Is DPO Superior to PPO for LLM Alignment? a Comprehensive Study Shusheng Xu, Wei Fu, Jiaxuan Gao, Wenjie Ye, Weilin Liu, Zhiyu Mei, Guangju Wang, Chao Yu, Yi Wu
TMLR 2023 Beyond Information Gain: An Empirical Benchmark for Low-Switching-Cost Reinforcement Learning Shusheng Xu, Yancheng Liang, Yunfei Li, Simon Shaolei Du, Yi Wu
ICLRW 2023 PhyloTransformer: A Self-Supervised Discriminative Model for SARS-CoV-2 Viral Mutation Prediction Based on a Multi-Head Self-Attention Mechanism Yingying Wu, Shusheng Xu, Shing-Tung Yau, Yi Wu
NeurIPS 2022 Grounded Reinforcement Learning: Learning to Win the Game Under Human Commands Shusheng Xu, Huaijie Wang, Yi Wu
AAAI 2022 Sequence Level Contrastive Learning for Text Summarization Shusheng Xu, Xingxing Zhang, Yi Wu, Furu Wei