Zhang, Stephen

5 publications

NeurIPS 2025 Attention Sinks: A 'Catch, Tag, Release' Mechanism for Embeddings Stephen Zhang, Mustafa Khan, Vardan Papyan
ICLRW 2025 Low-Rank Is Required for Pruning LLMs Stephen Zhang, Vardan Papyan
ICLR 2025 OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition Stephen Zhang, Vardan Papyan
ICML 2024 Sparsest Models Elude Pruning: An Exposé of Pruning’s Current Capabilities Stephen Zhang, Vardan Papyan
NeurIPS 2022 Trajectory Inference via Mean-Field Langevin in Path Space Lénaïc Chizat, Stephen Zhang, Matthieu Heitz, Geoffrey Schiebinger