Nikita, Surkov

1 publications

ICML 2025 Cache Me if You Must: Adaptive Key-Value Quantization for Large Language Models Alina Shutova, Vladimir Malinovskii, Vage Egiazarian, Denis Kuznedelev, Denis Mazur, Surkov Nikita, Ivan Ermakov, Dan Alistarh