Hooper, Coleman Richard Charles

4 publications

NeurIPS 2025 Multipole Attention for Efficient Long Context Reasoning Coleman Richard Charles Hooper, Sebastian Zhao, Luca Manolache, Sehoon Kim, Michael W. Mahoney, Sophia Shao, Kurt Keutzer, Amir Gholami
ICML 2025 QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache Rishabh Tiwari, Haocheng Xi, Aditya Tomar, Coleman Richard Charles Hooper, Sehoon Kim, Maxwell Horton, Mahyar Najibi, Michael W. Mahoney, Kurt Keutzer, Amir Gholami
ICMLW 2024 Learned Best-Effort LLM Serving Siddharth Jha, Coleman Richard Charles Hooper, Xiaoxuan Liu, Sehoon Kim, Kurt Keutzer
ICML 2024 SqueezeLLM: Dense-and-Sparse Quantization Sehoon Kim, Coleman Richard Charles Hooper, Amir Gholami, Zhen Dong, Xiuyu Li, Sheng Shen, Michael W. Mahoney, Kurt Keutzer