ML Anthology
Authors
Search
About
Choi, Euntae
1 publications
NeurIPS
2025
NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache
Donghyun Son
,
Euntae Choi
,
Sungjoo Yoo