Liuyue

1 publications

NeurIPS 2025 ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference Xiang Liu, Zhenheng Tang, Peijie Dong, Zeyu Li, Liuyue, Bo Li, Xuming Hu, Xiaowen Chu