ML Anthology
Authors
Search
About
Kim, Minsub
1 publications
ICLR
2024
LUT-GEMM: Quantized Matrix Multiplication Based on LUTs for Efficient Inference in Large-Scale Generative Language Models
Gunho Park
,
Baeseong Park
,
Minsub Kim
,
Sungjae Lee
,
Jeonghoon Kim
,
Beomseok Kwon
,
Se Jung Kwon
,
Byeongwook Kim
,
Youngjoo Lee
,
Dongsoo Lee