ML Anthology
Authors
Search
About
Upasani, Shubhangi
1 publications
ICLRW
2025
LLMs Know What to Drop: Self-Attention Guided KV Cache Eviction for Efficient Long-Context Inference
Guangtao Wang
,
Shubhangi Upasani
,
Chen Wu
,
Darshan Gandhi
,
Jonathan Lingjie Li
,
Changran Hu
,
Bo Li
,
Urmish Thakker