Zeng, Zhanpeng

10 publications

NeurIPS 2025 Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings Qiong Wu, Wenhao Lin, Yiyi Zhou, Weihao Ye, Zhanpeng Zeng, Xiaoshuai Sun, Rongrong Ji
NeurIPS 2024 Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization Qihao Liu, Zhanpeng Zeng, Ju He, Qihang Yu, Xiaohui Shen, Liang-Chieh Chen
ICML 2024 FrameQuant: Flexible Low-Bit Quantization for Transformers Harshavardhan Adepu, Zhanpeng Zeng, Li Zhang, Vikas Singh
ICML 2024 IM-Unpack: Training and Inference with Arbitrarily Low Precision Integers Zhanpeng Zeng, Karthikeyan Sankaralingam, Vikas Singh
ICML 2023 Controlled Differential Equations on Long Sequences via Non-Standard Wavelets Sourav Pal, Zhanpeng Zeng, Sathya N. Ravi, Vikas Singh
ICML 2023 LookupFFN: Making Transformers Compute-Lite for CPU Inference Zhanpeng Zeng, Michael Davies, Pranav Pulijala, Karthikeyan Sankaralingam, Vikas Singh
NeurIPS 2023 VCC: Scaling Transformers to 128k Tokens or More by Prioritizing Important Tokens Zhanpeng Zeng, Cole Hawkins, Mingyi Hong, Aston Zhang, Nikolaos Pappas, Vikas Singh, Shuai Zheng
ICML 2022 Multi Resolution Analysis (MRA) for Approximate Self-Attention Zhanpeng Zeng, Sourav Pal, Jeffery Kline, Glenn M Fung, Vikas Singh
AAAI 2021 Nyströmformer: A Nyström-Based Algorithm for Approximating Self-Attention Yunyang Xiong, Zhanpeng Zeng, Rudrasis Chakraborty, Mingxing Tan, Glenn Fung, Yin Li, Vikas Singh
ICML 2021 You Only Sample (Almost) Once: Linear Cost Self-Attention via Bernoulli Sampling Zhanpeng Zeng, Yunyang Xiong, Sathya Ravi, Shailesh Acharya, Glenn M Fung, Vikas Singh