Wang, Zecheng

3 publications

ICML 2025 Maximizing Intermediate Checkpoint Value in LLM Pretraining with Bayesian Optimization Deyuan Liu, Zecheng Wang, Bingning Wang, Weipeng Chen, Chunshan Li, Zhiying Tu, Dianhui Chu, Dianbo Sui
NeurIPS 2025 VPO: Reasoning Preferences Optimization Based on $\mathcal{V}$-Usable Information Zecheng Wang, Chunshan Li, Yupeng Zhang, Han Liu, Bingning Wang, Dianhui Chu, Dianbo Sui
ICLR 2024 Pre-Training with Synthetic Data Helps Offline Reinforcement Learning Zecheng Wang, Che Wang, Zixuan Dong, Keith W. Ross