ML Anthology
Authors
Search
About
Kwon, Beomseok
2 publications
ICLR
2024
LUT-GEMM: Quantized Matrix Multiplication Based on LUTs for Efficient Inference in Large-Scale Generative Language Models
Gunho Park
,
Baeseong Park
,
Minsub Kim
,
Sungjae Lee
,
Jeonghoon Kim
,
Beomseok Kwon
,
Se Jung Kwon
,
Byeongwook Kim
,
Youngjoo Lee
,
Dongsoo Lee
ICLR
2024
Rethinking Channel Dimensions to Isolate Outliers for Low-Bit Weight Quantization of Large Language Models
Jung Hwan Heo
,
Jeonghoon Kim
,
Beomseok Kwon
,
Byeongwook Kim
,
Se Jung Kwon
,
Dongsoo Lee