Li, Shenggui

3 publications

ICLR 2026 DSA: Efficient Inference for Video Generation Models via Distributed Sparse Attention Shenggui Li, Runyu Lu, Qiaoling Chen, Haiyan Yin, Yueming Lyu, Yonggang Wen, Ivor Tsang, Tianwei Zhang
ICML 2024 GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding Cunxiao Du, Jing Jiang, Xu Yuanchen, Jiawei Wu, Sicheng Yu, Yongqi Li, Shenggui Li, Kai Xu, Liqiang Nie, Zhaopeng Tu, Yang You
ICMLW 2023 Sequence Parallelism: Long Sequence Training from System Perspective Shenggui Li, Fuzhao Xue, Chaitanya Baranwal, Yongbin Li, Yang You