Kuzmin, Andrey

6 publications

ICMLW 2024 GPTVQ: The Blessing of Dimensionality for LLM Quantization Mart Van Baalen, Andrey Kuzmin, Markus Nagel, Peter Couperus, Artem Bolshakov, Cedric Bastoul, Eric Mahurin, Tijmen Blankevoort, Paul Whatmough
NeurIPS 2023 Pruning vs Quantization: Which Is Better? Andrey Kuzmin, Markus Nagel, Mart van Baalen, Arash Behboodi, Tijmen Blankevoort
CVPRW 2022 Cyclical Pruning for Sparse Neural Networks Suraj Srinivas, Andrey Kuzmin, Markus Nagel, Mart van Baalen, Andrii Skliar, Tijmen Blankevoort
NeurIPS 2022 FP8 Quantization: The Power of the Exponent Andrey Kuzmin, Mart van Baalen, Yuwei Ren, Markus Nagel, Jorn Peters, Tijmen Blankevoort
CVPRW 2022 Simulated Quantization, Real Power Savings Mart van Baalen, Brian Kahne, Eric Mahurin, Andrey Kuzmin, Andrii Skliar, Markus Nagel, Tijmen Blankevoort
ICCVW 2017 Set2Model Networks: Learning Discriminatively to Learn Generative Models Andrey Kuzmin, Alexander Vakhitov, Victor S. Lempitsky