Whatmough, Paul

8 publications

ICMLW 2024 GPTVQ: The Blessing of Dimensionality for LLM Quantization Mart Van Baalen, Andrey Kuzmin, Markus Nagel, Peter Couperus, Artem Bolshakov, Cedric Bastoul, Eric Mahurin, Tijmen Blankevoort, Paul Whatmough
ICMLW 2024 Rapid Switching and Multi-Adapter Fusion via Sparse High Rank Adapters Kartikeya Bhardwaj, Nilesh Prasad Pandey, Sweta Priyadarshi, Viswanath Ganapathy, Rafael Esteves, Shreya Kadambi, Shubhankar Borse, Paul Whatmough, Risheek Garrepalli, Mart Van Baalen, Harris Teague, Markus Nagel
NeurIPS 2024 Sparse High Rank Adapters Kartikeya Bhardwaj, Nilesh Prasad Pandey, Sweta Priyadarshi, Viswanath Ganapathy, Shreya Kadambi, Rafael Esteves, Shubhankar Borse, Paul Whatmough, Risheek Garrepalli, Mart Van Baalen, Harris Teague, Markus Nagel
ICLR 2023 Efficient Edge Inference by Selective Query Anil Kag, Igor Fedorov, Aditya Gangrade, Paul Whatmough, Venkatesh Saligrama
NeurIPS 2022 UDC: Unified DNAS for Compressible TinyML Models for Neural Processing Units Igor Fedorov, Ramon Matas, Hokchhay Tann, Chuteng Zhou, Matthew Mattina, Paul Whatmough
ICML 2021 Debiasing Model Updates for Improving Personalized Federated Training Durmus Alp Emre Acar, Yue Zhao, Ruizhao Zhu, Ramon Matas, Matthew Mattina, Paul Whatmough, Venkatesh Saligrama
ICLR 2021 Federated Learning Based on Dynamic Regularization Durmus Alp Emre Acar, Yue Zhao, Ramon Matas, Matthew Mattina, Paul Whatmough, Venkatesh Saligrama
NeurIPS 2019 SpArSe: Sparse Architecture Search for CNNs on Resource-Constrained Microcontrollers Igor Fedorov, Ryan P. Adams, Matthew Mattina, Paul Whatmough