Tu, Dezhan

1 publications

ICLR 2025 VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration Dezhan Tu, Danylo Vashchilenko, Yuzhe Lu, Panpan Xu