ML Anthology
Authors
Search
About
Sun, Weigao
9 publications
NeurIPS
2025
Improving Bilinear RNN with Closed-Loop Control
Jiaxi Hu
,
Yongqi Pan
,
Jusen Du
,
Disen Lan
,
Xiaqiang Tang
,
Qingsong Wen
,
Yuxuan Liang
,
Weigao Sun
TMLR
2025
LASP: Linear Attention Sequence Parallelism
Weigao Sun
,
Zhen Qin
,
Dong Li
,
Xuyang Shen
,
Yu Qiao
,
Yiran Zhong
ICML
2025
Liger: Linearizing Large Language Models to Gated Recurrent Structures
Disen Lan
,
Weigao Sun
,
Jiaxi Hu
,
Jusen Du
,
Yu Cheng
ICLRW
2025
Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts
Weigao Sun
,
Disen Lan
,
Tong Zhu
,
Xiaoye Qu
,
Yu Cheng
AAAI
2025
Sequence Accumulation and Beyond: Infinite Context Length on Single GPU and Large Clusters
Weigao Sun
,
Yongtuo Liu
,
Xiaqiang Tang
,
Xiaoyu Mo
ICLR
2024
CO2: Efficient Distributed Training with Full Communication-Computation Overlap
Weigao Sun
,
Zhen Qin
,
Weixuan Sun
,
Shidi Li
,
Dong Li
,
Xuyang Shen
,
Yu Qiao
,
Yiran Zhong
NeurIPSW
2024
Linear Attention Sequence Parallelism
Weigao Sun
,
Zhen Qin
,
Dong Li
,
Xuyang Shen
,
Yu Qiao
,
Yiran Zhong
ICML
2024
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention
Zhen Qin
,
Weigao Sun
,
Dong Li
,
Xuyang Shen
,
Weixuan Sun
,
Yiran Zhong
IJCAI
2020
pbSGD: Powered Stochastic Gradient Descent Methods for Accelerated Non-Convex Optimization
Beitong Zhou
,
Jun Liu
,
Weigao Sun
,
Ruijuan Chen
,
Claire J. Tomlin
,
Ye Yuan