Chen, Zhuoming

13 publications

ICML 2025 GSM-$∞$: How Do Your LLMs Behave over Infinitely Increasing Reasoning Complexity and Context Length? Yang Zhou, Hongyi Liu, Zhuoming Chen, Yuandong Tian, Beidi Chen
NeurIPS 2025 Kinetics: Rethinking Test-Time Scaling Law Ranajoy Sadhukhan, Zhuoming Chen, Haizhong Zheng, Beidi Chen
ICLR 2025 MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding Ranajoy Sadhukhan, Jian Chen, Zhuoming Chen, Vashisth Tiwari, Ruihang Lai, Jinyuan Shi, Ian En-Hsu Yen, Avner May, Tianqi Chen, Beidi Chen
ICLR 2025 MagicPIG: LSH Sampling for Efficient LLM Generation Zhuoming Chen, Ranajoy Sadhukhan, Zihao Ye, Yang Zhou, Jianyu Zhang, Niklas Nolte, Yuandong Tian, Matthijs Douze, Leon Bottou, Zhihao Jia, Beidi Chen
NeurIPSW 2024 CAT Pruning: Cluster-Aware Token Pruning for Text-to-Image Diffusion Models Xinle Cheng, Zhuoming Chen, Zhihao Jia
ICMLW 2024 MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory for Long Sequences Training Cheng Luo, Jiawei Zhao, Zhuoming Chen, Beidi Chen, Anima Anandkumar
NeurIPSW 2024 MagicPIG: LSH Sampling for Efficient LLM Generation Zhuoming Chen, Ranajoy Sadhukhan, Zihao Ye, Yang Zhou, Jianyu Zhang, Niklas Nolte, Yuandong Tian, Matthijs Douze, Leon Bottou, Zhihao Jia, Beidi Chen
NeurIPS 2024 Mini-Sequence Transformers: Optimizing Intermediate Memory for Long Sequences Training Cheng Luo, Jiawei Zhao, Zhuoming Chen, Beidi Chen, Anima Anandkumar
NeurIPS 2024 SIRIUS : Contexual Sparisty with Correction for Efficient LLMs Yang Zhou, Zhuoming Chen, Zhaozhuo Xu, Xi Victoria Lin, Beidi Chen
NeurIPS 2024 Sequoia: Scalable and Robust Speculative Decoding Zhuoming Chen, Avner May, Ruslan Svirschevski, Yuhsun Huang, Max Ryabinin, Zhihao Jia, Beidi Chen
NeurIPSW 2024 Sirius: Contextual Sparsity with Correction for Efficient LLM Yang Zhou, Zhuoming Chen, Zhaozhuo Xu, Xi Victoria Lin, Beidi Chen
NeurIPS 2024 SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices Ruslan Svirschevski, Avner May, Zhuoming Chen, Beidi Chen, Zhihao Jia, Max Ryabinin
NeurIPS 2022 Quantized Training of Gradient Boosting Decision Trees Yu Shi, Guolin Ke, Zhuoming Chen, Shuxin Zheng, Tie-Yan Liu