Elangovan, Reena

1 publications

TMLR 2025 LO-BCQ: Locally Optimal Block Clustered Quantization for 4-Bit (W4A4) LLM Inference Reena Elangovan, Charbel Sakr, Anand Raghunathan, Brucek Khailany