ML Anthology
Authors
Search
About
Xie, Victor
1 publications
NeurIPS
2023
Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time
Zichang Liu
,
Aditya Desai
,
Fangshuo Liao
,
Weitao Wang
,
Victor Xie
,
Zhaozhuo Xu
,
Anastasios Kyrillidis
,
Anshumali Shrivastava