ML Anthology
Authors
Search
About
Liuyue
1 publications
NeurIPS
2025
ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference
Xiang Liu
,
Zhenheng Tang
,
Peijie Dong
,
Zeyu Li
,
Liuyue
,
Bo Li
,
Xuming Hu
,
Xiaowen Chu