ML Anthology
Authors
Search
About
Park, Gunho
2 publications
NeurIPS
2025
CodeGEMM: A Codebook-Centric Approach to Efficient GEMM in Quantized LLMs
Gunho Park
,
Jeongin Bae
,
Byeongwook Kim
,
Baeseong Park
,
Jiwon Ryu
,
Hoseung Kim
,
Se Jung Kwon
,
Dongsoo Lee
ICLR
2024
LUT-GEMM: Quantized Matrix Multiplication Based on LUTs for Efficient Inference in Large-Scale Generative Language Models
Gunho Park
,
Baeseong Park
,
Minsub Kim
,
Sungjae Lee
,
Jeonghoon Kim
,
Beomseok Kwon
,
Se Jung Kwon
,
Byeongwook Kim
,
Youngjoo Lee
,
Dongsoo Lee