ML Anthology
Authors
Search
About
Nikita, Surkov
1 publications
ICML
2025
Cache Me if You Must: Adaptive Key-Value Quantization for Large Language Models
Alina Shutova
,
Vladimir Malinovskii
,
Vage Egiazarian
,
Denis Kuznedelev
,
Denis Mazur
,
Surkov Nikita
,
Ivan Ermakov
,
Dan Alistarh