Li, Steven

1 publications

ICLR 2025 R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference Zhenyu Zhang, Zechun Liu, Yuandong Tian, Harshit Khaitan, Zhangyang Wang, Steven Li