Cheng, Shenggan

6 publications

ICML 2025 DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers Xuanlei Zhao, Shenggan Cheng, Chang Chen, Zangwei Zheng, Ziming Liu, Zheming Yang, Yang You
NeurIPS 2025 ElasticMM: Efficient Multimodal LLMs Serving with Elastic Multimodal Parallelism Zedong Liu, Shenggan Cheng, Guangming Tan, Yang You, Dingwen Tao
ICML 2025 SeedLoRA: A Fusion Approach to Efficient LLM Fine-Tuning Yong Liu, Di Fu, Shenggan Cheng, Zirui Zhu, Yang Luo, Minhao Cheng, Cho-Jui Hsieh, Yang You
NeurIPS 2025 StarTrail: Concentric Ring Sequence Parallelism for Efficient Near-Infinite-Context Transformer Model Training Ziming Liu, Shaoyu Wang, Shenggan Cheng, Zhongkai Zhao, Kai Wang, Xuanlei Zhao, James Demmel, Yang You
ICLR 2024 AutoChunk: Automated Activation Chunk for Memory-Efficient Deep Learning Inference Xuanlei Zhao, Shenggan Cheng, Guangyang Lu, Haotian Zhou, Bin Jia, Yang You
ECCV 2020 FTL: A Universal Framework for Training Low-Bit DNNs via Feature Transfer Kunyuan Du, Ya Zhang, Haibing Guan, Qi Tian, Shenggan Cheng, James Lin