ML Anthology
Authors
Search
About
Hooper, Coleman Richard Charles
4 publications
NeurIPS
2025
Multipole Attention for Efficient Long Context Reasoning
Coleman Richard Charles Hooper
,
Sebastian Zhao
,
Luca Manolache
,
Sehoon Kim
,
Michael W. Mahoney
,
Sophia Shao
,
Kurt Keutzer
,
Amir Gholami
ICML
2025
QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache
Rishabh Tiwari
,
Haocheng Xi
,
Aditya Tomar
,
Coleman Richard Charles Hooper
,
Sehoon Kim
,
Maxwell Horton
,
Mahyar Najibi
,
Michael W. Mahoney
,
Kurt Keutzer
,
Amir Gholami
ICMLW
2024
Learned Best-Effort LLM Serving
Siddharth Jha
,
Coleman Richard Charles Hooper
,
Xiaoxuan Liu
,
Sehoon Kim
,
Kurt Keutzer
ICML
2024
SqueezeLLM: Dense-and-Sparse Quantization
Sehoon Kim
,
Coleman Richard Charles Hooper
,
Amir Gholami
,
Zhen Dong
,
Xiuyu Li
,
Sheng Shen
,
Michael W. Mahoney
,
Kurt Keutzer