Kumbong, Hermann

6 publications

CVPR 2025 HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation Hermann Kumbong, Xian Liu, Tsung-Yi Lin, Ming-Yu Liu, Xihui Liu, Ziwei Liu, Daniel Y. Fu, Christopher Re, David W. Romero
ICML 2025 LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs Under 2 Bits Zikai Zhou, Qizheng Zhang, Hermann Kumbong, Kunle Olukotun
ICLR 2024 FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores Daniel Y Fu, Hermann Kumbong, Eric Nguyen, Christopher Re
ICLR 2024 The Hedgehog & the Porcupine: Expressive Linear Attentions with SoftMax Mimicry Michael Zhang, Kush Bhatia, Hermann Kumbong, Christopher Re
ICMLW 2023 GPT-Zip: Deep Compression of Finetuned Large Language Models Berivan Isik, Hermann Kumbong, Wanyi Ning, Xiaozhe Yao, Sanmi Koyejo, Ce Zhang
NeurIPS 2023 Laughing Hyena Distillery: Extracting Compact Recurrences from Convolutions Stefano Massaroli, Michael Poli, Dan Fu, Hermann Kumbong, Rom Parnichkun, David Romero, Aman Timalsina, Quinn McIntyre, Beidi Chen, Atri Rudra, Ce Zhang, Christopher RĂ©, Stefano Ermon, Yoshua Bengio