ML Anthology
Authors
Search
About
Tian, Changxin
2 publications
ICLR
2026
Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models
Changxin Tian
,
Kunlong Chen
,
Jia Liu
,
Ziqi Liu
,
Zhiqiang Zhang
,
Jun Zhou
ICLR
2026
WSM: Decay-Free Learning Rate Schedule via Checkpoint Merging for LLM Pre-Training
Changxin Tian
,
Jiapeng Wang
,
Qian Zhao
,
Kunlong Chen
,
Jia Liu
,
Ziqi Liu
,
Jiaxin Mao
,
Xin Zhao
,
Zhiqiang Zhang
,
Jun Zhou