Wang, Jinbo

3 publications

ICLR 2026 Fast Catch-up, Late Switching: Optimal Batch Size Scheduling via Functional Scaling Laws Jinbo Wang, Binghui Li, Zhanpeng Zhou, Mingze Wang, Yuxuan Sun, Jiaqi Zhang, Xunliang Cai, Lei Wu
ICML 2025 The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training Jinbo Wang, Mingze Wang, Zhanpeng Zhou, Junchi Yan, Weinan E, Lei Wu
NeurIPS 2024 Improving Generalization and Convergence by Enhancing Implicit Regularization Mingze Wang, Jinbo Wang, Haotian He, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, Lei Wu