ML Anthology
Authors
Search
About
Tu, Dezhan
1 publications
ICLR
2025
VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration
Dezhan Tu
,
Danylo Vashchilenko
,
Yuzhe Lu
,
Panpan Xu