Choi, Euntae

1 publications

NeurIPS 2025 NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache Donghyun Son, Euntae Choi, Sungjoo Yoo