Ma, Yiyuan

3 publications

ICLR 2026 Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Ang Lv, Jin Ma, Yiyuan Ma, Siyuan Qiao
ICLR 2025 FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference Xunhao Lai, Jianqiao Lu, Yao Luo, Yiyuan Ma, Xun Zhou
NeurIPS 2025 Model Merging in Pre-Training of Large Language Models Yunshui Li, Yiyuan Ma, Shen Yan, Chaoyi Zhang, Jing Liu, Jianqiao Lu, Ziwen Xu, Mengzhao Chen, Minrui Wang, Shiyi Zhan, Jin Ma, Xunhao Lai, Yao Luo, Xingyan Bin, Hongbin Ren, Mingji Han, Wenhao Hao, Bairen Yi, LingJun Liu, Bole Ma, Xiaoying Jia, Zhou Xun, Liang Xiang, Yonghui Wu