ML Anthology
Authors
Search
About
Lv, Junlin
2 publications
ICLR
2026
DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference
Yuan Feng
,
Haoyu Guo
,
Junlin Lv
,
S Kevin Zhou
,
Xike Xie
NeurIPS
2025
Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference
Yuan Feng
,
Junlin Lv
,
Yukun Cao
,
Xike Xie
,
S Kevin Zhou