Li, Chunshan

2 publications

ICML 2025 Maximizing Intermediate Checkpoint Value in LLM Pretraining with Bayesian Optimization Deyuan Liu, Zecheng Wang, Bingning Wang, Weipeng Chen, Chunshan Li, Zhiying Tu, Dianhui Chu, Dianbo Sui
NeurIPS 2025 VPO: Reasoning Preferences Optimization Based on $\mathcal{V}$-Usable Information Zecheng Wang, Chunshan Li, Yupeng Zhang, Han Liu, Bingning Wang, Dianhui Chu, Dianbo Sui