ML Anthology
Authors
Search
About
Yang, Chun-Che
1 publications
NeurIPS
2025
Speculate Deep and Accurate: Lossless and Training-Free Acceleration for Offloaded LLMs via Substitute Speculative Decoding
Pei-Shuo Wang
,
Jian-Jia Chen
,
Chun-Che Yang
,
Chi-Chih Chang
,
Ning-Chi Huang
,
Mohamed S. Abdelfattah
,
Kai-Chiang Wu