Khailany, Brucek

6 publications

ICLR 2026 QuRL: Low-Precision Reinforcement Learning for Efficient Reasoning Yuhang Li, Reena Elangovan, Xin Dong, Priyadarshini Panda, Brucek Khailany
ICLR 2026 ThinKV: Thought-Adaptive KV Cache Compression for Efficient Reasoning Models Akshat Ramachandran, Marina Neseem, Charbel Sakr, Rangharajan Venkatesan, Brucek Khailany, Tushar Krishna
TMLR 2025 LO-BCQ: Locally Optimal Block Clustered Quantization for 4-Bit (W4A4) LLM Inference Reena Elangovan, Charbel Sakr, Anand Raghunathan, Brucek Khailany
AAAI 2025 VerilogCoder: Autonomous Verilog Coding Agents with Graph-Based Planning and Abstract Syntax Tree (AST)-Based Waveform Tracing Tool Chia-Tung Ho, Haoxing Ren, Brucek Khailany
NeurIPS 2024 ESPACE: Dimensionality Reduction of Activations for Model Compression Charbel Sakr, Brucek Khailany
ICML 2022 Optimal Clipping and Magnitude-Aware Differentiation for Improved Quantization-Aware Training Charbel Sakr, Steve Dai, Rangha Venkatesan, Brian Zimmer, William Dally, Brucek Khailany