ML Anthology
Authors
Search
About
Xu, Ruge
1 publications
NeurIPS
2025
CAS-Spec: Cascade Adaptive Self-Speculative Decoding for On-the-Fly Lossless Inference Acceleration of LLMs
Zhiyuan Ning
,
Jiawei Shao
,
Ruge Xu
,
Xinfei Guo
,
Jun Zhang
,
Chi Zhang
,
Xuelong Li