Fedorov, Igor

7 publications

ICLR 2025 SpinQuant: LLM Quantization with Learned Rotations Zechun Liu, Changsheng Zhao, Igor Fedorov, Bilge Soran, Dhruv Choudhary, Raghuraman Krishnamoorthi, Vikas Chandra, Yuandong Tian, Tijmen Blankevoort
TMLR 2025 ∇QDARTS: Quantization as an Elastic Dimension to Differentiable NAS Payman Behnam, Uday Kamal, Sanjana Vijay Ganesh, Zhaoyi Li, Michael Andrew Jurado, Alind Khare, Igor Fedorov, Gaowen Liu, Alexey Tumanov
ECCV 2024 DεpS: Delayed Ε-Shrinking for Faster Once-for-All Training Aditya Annavajjala, Alind Khare, Animesh Agrawal, Igor Fedorov, Hugo M Latapie, Myungjin Lee, Alexey Tumanov
ICML 2024 MobileLLM: Optimizing Sub-Billion Parameter Language Models for On-Device Use Cases Zechun Liu, Changsheng Zhao, Forrest Iandola, Chen Lai, Yuandong Tian, Igor Fedorov, Yunyang Xiong, Ernie Chang, Yangyang Shi, Raghuraman Krishnamoorthi, Liangzhen Lai, Vikas Chandra
ICLR 2023 Efficient Edge Inference by Selective Query Anil Kag, Igor Fedorov, Aditya Gangrade, Paul Whatmough, Venkatesh Saligrama
NeurIPS 2022 UDC: Unified DNAS for Compressible TinyML Models for Neural Processing Units Igor Fedorov, Ramon Matas, Hokchhay Tann, Chuteng Zhou, Matthew Mattina, Paul Whatmough
NeurIPS 2019 SpArSe: Sparse Architecture Search for CNNs on Resource-Constrained Microcontrollers Igor Fedorov, Ryan P. Adams, Matthew Mattina, Paul Whatmough