ML Anthology
Authors
Search
About
Yazdani Aminabadi, Reza
1 publications
ICML
2023
Understanding Int4 Quantization for Language Models: Latency Speedup, Composability, and Failure Cases
Xiaoxia Wu
,
Cheng Li
,
Reza Yazdani Aminabadi
,
Zhewei Yao
,
Yuxiong He