Wang, Jinbo

2 publications

ICML 2025 The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training Jinbo Wang, Mingze Wang, Zhanpeng Zhou, Junchi Yan, Weinan E, Lei Wu
NeurIPS 2024 Improving Generalization and Convergence by Enhancing Implicit Regularization Mingze Wang, Jinbo Wang, Haotian He, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, Lei Wu