Cheng, Jialiang

2 publications

ICLR 2026 SERE: Similarity-Based Expert Re-Routing for Efficient Batch Decoding in MoE Models Juntong Wu, Jialiang Cheng, Fuyu Lv, Ou Dan, Li Yuan
ICLR 2025 EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models Jialiang Cheng, Ning Gao, Yun Yue, Zhiling Ye, Jiadi Jiang, Jian Sha