Gandhi, Darshan

1 publications

ICLRW 2025 LLMs Know What to Drop: Self-Attention Guided KV Cache Eviction for Efficient Long-Context Inference Guangtao Wang, Shubhangi Upasani, Chen Wu, Darshan Gandhi, Jonathan Lingjie Li, Changran Hu, Bo Li, Urmish Thakker