ML Anthology
Authors
Search
About
Xia, Heming
1 publications
ICLR
2025
SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
Heming Xia
,
Yongqi Li
,
Jun Zhang
,
Cunxiao Du
,
Wenjie Li