Du, Cunxiao

10 publications

ICML 2025 BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms Yunlong Hou, Fengzhuo Zhang, Cunxiao Du, Xuan Zhang, Jiachun Pan, Tianyu Pang, Chao Du, Vincent Tan, Zhuoran Yang
ICLR 2025 Efficient Inference for Large Language Model-Based Generative Recommendation Xinyu Lin, Chaoqun Yang, Wenjie Wang, Yongqi Li, Cunxiao Du, Fuli Feng, See-Kiong Ng, Tat-Seng Chua
TMLR 2025 LightTransfer: Your Long-Context LLM Is Secretly a Hybrid Model with Effortless Adaptation Xuan Zhang, Fengzhuo Zhang, Cunxiao Du, Chao Du, Tianyu Pang, Wei Gao, Min Lin
ICLRW 2025 LightTransfer: Your Long-Context LLM Is Secretly a Hybrid Model with Effortless Adaptation Xuan Zhang, Fengzhuo Zhang, Cunxiao Du, Chao Du, Tianyu Pang, Wei Gao, Min Lin
ICLR 2025 SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration Heming Xia, Yongqi Li, Jun Zhang, Cunxiao Du, Wenjie Li
ICLR 2025 When Attention Sink Emerges in Language Models: An Empirical View Xiangming Gu, Tianyu Pang, Chao Du, Qian Liu, Fengzhuo Zhang, Cunxiao Du, Ye Wang, Min Lin
TMLR 2025 When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training Haonan Wang, Qian Liu, Chao Du, Tongyao Zhu, Cunxiao Du, Kenji Kawaguchi, Tianyu Pang
ICML 2024 GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding Cunxiao Du, Jing Jiang, Xu Yuanchen, Jiawei Wu, Sicheng Yu, Yongqi Li, Shenggui Li, Kai Xu, Liqiang Nie, Zhaopeng Tu, Yang You
ICML 2021 Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation Cunxiao Du, Zhaopeng Tu, Jing Jiang
AAAI 2019 Explicit Interaction Model Towards Text Classification Cunxiao Du, Zhaozheng Chen, Fuli Feng, Lei Zhu, Tian Gan, Liqiang Nie