ML Anthology
Authors
Search
About
Wang, Pei-Shuo
2 publications
ICLR
2025
Palu: KV-Cache Compression with Low-Rank Projection
Chi-Chih Chang
,
Wei-Cheng Lin
,
Chien-Yu Lin
,
Chong-Yan Chen
,
Yu-Fang Hu
,
Pei-Shuo Wang
,
Ning-Chi Huang
,
Luis Ceze
,
Mohamed S. Abdelfattah
,
Kai-Chiang Wu
NeurIPS
2025
Speculate Deep and Accurate: Lossless and Training-Free Acceleration for Offloaded LLMs via Substitute Speculative Decoding
Pei-Shuo Wang
,
Jian-Jia Chen
,
Chun-Che Yang
,
Chi-Chih Chang
,
Ning-Chi Huang
,
Mohamed S. Abdelfattah
,
Kai-Chiang Wu