Tian, Changxin

2 publications

ICLR 2026 Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models Changxin Tian, Kunlong Chen, Jia Liu, Ziqi Liu, Zhiqiang Zhang, Jun Zhou
ICLR 2026 WSM: Decay-Free Learning Rate Schedule via Checkpoint Merging for LLM Pre-Training Changxin Tian, Jiapeng Wang, Qian Zhao, Kunlong Chen, Jia Liu, Ziqi Liu, Jiaxin Mao, Xin Zhao, Zhiqiang Zhang, Jun Zhou