Alistarh, Dan
75 publications
ICLR
2025
The Journey Matters: Average Parameter Count over Pre-Training Unifies Sparse and Dense Scaling Laws
NeurIPS
2024
MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence
NeurIPS
2024
The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Information
ICML
2023
SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks at the Edge
NeurIPS
2022
Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning
AAAI
2021
Asynchronous Optimization Methods for Efficient Training of Deep Neural Networks with Guarantees
AAAI
2021
Elastic Consistency: A Practical Consistency Model for Distributed Stochastic Gradient Descent