Chen, Mingyu

6 publications

NeurIPS 2025 Accelerating RL for LLM Reasoning with Optimal Advantage Regression Kianté Brantley, Mingyu Chen, Zhaolin Gao, Jason D. Lee, Wen Sun, Wenhao Zhan, Xuezhou Zhang
NeurIPS 2025 Avoiding exp(R) Scaling in RLHF Through Preference-Based Exploration Mingyu Chen, Yiding Chen, Wen Sun, Xuezhou Zhang
ICMLW 2024 Improved Algorithms for Adversarial Bandits with Unbounded Losses Mingyu Chen, Xuezhou Zhang
COLT 2024 Scale-Free Adversarial Reinforcement Learning Mingyu Chen, Xuezhou Zhang
NeurIPS 2024 State-Free Reinforcement Learning Mingyu Chen, Aldo Pacchiano, Xuezhou Zhang
ICMLW 2023 Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models’ Reasoning Performance Yao Fu, Litu Ou, Yuhao Wan, Mingyu Chen, Hao Peng, Tushar Khot