Sun, Weigao

9 publications

NeurIPS 2025 Improving Bilinear RNN with Closed-Loop Control Jiaxi Hu, Yongqi Pan, Jusen Du, Disen Lan, Xiaqiang Tang, Qingsong Wen, Yuxuan Liang, Weigao Sun
TMLR 2025 LASP: Linear Attention Sequence Parallelism Weigao Sun, Zhen Qin, Dong Li, Xuyang Shen, Yu Qiao, Yiran Zhong
ICML 2025 Liger: Linearizing Large Language Models to Gated Recurrent Structures Disen Lan, Weigao Sun, Jiaxi Hu, Jusen Du, Yu Cheng
ICLRW 2025 Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts Weigao Sun, Disen Lan, Tong Zhu, Xiaoye Qu, Yu Cheng
AAAI 2025 Sequence Accumulation and Beyond: Infinite Context Length on Single GPU and Large Clusters Weigao Sun, Yongtuo Liu, Xiaqiang Tang, Xiaoyu Mo
ICLR 2024 CO2: Efficient Distributed Training with Full Communication-Computation Overlap Weigao Sun, Zhen Qin, Weixuan Sun, Shidi Li, Dong Li, Xuyang Shen, Yu Qiao, Yiran Zhong
NeurIPSW 2024 Linear Attention Sequence Parallelism Weigao Sun, Zhen Qin, Dong Li, Xuyang Shen, Yu Qiao, Yiran Zhong
ICML 2024 Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention Zhen Qin, Weigao Sun, Dong Li, Xuyang Shen, Weixuan Sun, Yiran Zhong
IJCAI 2020 pbSGD: Powered Stochastic Gradient Descent Methods for Accelerated Non-Convex Optimization Beitong Zhou, Jun Liu, Weigao Sun, Ruijuan Chen, Claire J. Tomlin, Ye Yuan