Khaled, Ahmed

15 publications

NeurIPS 2025 Understanding Outer Optimizers in Local SGD: Learning Rates, Momentum, and Acceleration Ahmed Khaled, Satyen Kale, Arthur Douillard, Chi Jin, Rob Fergus, Manzil Zaheer
NeurIPS 2024 Directional Smoothness and Gradient Methods: Convergence and Adaptivity Aaron Mishkin, Ahmed Khaled, Yuanhao Wang, Aaron Defazio, Robert M. Gower
NeurIPS 2024 Don't Compress Gradients in Random Reshuffling: Compress Gradient Differences Abdurakhmon Sadiev, Grigory Malinovsky, Eduard Gorbunov, Igor Sokolov, Ahmed Khaled, Konstantin Burlachenko, Peter Richtárik
NeurIPS 2024 The Road Less Scheduled Aaron Defazio, Xingyu Yang, Harsh Mehta, Konstantin Mishchenko, Ahmed Khaled, Ashok Cutkosky
ICML 2024 Tuning-Free Stochastic Optimization Ahmed Khaled, Chi Jin
NeurIPSW 2023 A Novel Analysis of Gradient Descent Under Directional Smoothness Aaron Mishkin, Ahmed Khaled, Aaron Defazio, Robert M. Gower
TMLR 2023 Better Theory for SGD in the Nonconvex World Ahmed Khaled, Peter Richtárik
NeurIPS 2023 DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method Ahmed Khaled, Konstantin Mishchenko, Chi Jin
ICLR 2023 Faster Federated Optimization Under Second-Order Similarity Ahmed Khaled, Chi Jin
ICMLW 2023 Federated Optimization Algorithms with Random Reshuffling and Gradient Compression Abdurakhmon Sadiev, Grigory Malinovsky, Eduard Gorbunov, Igor Sokolov, Ahmed Khaled, Konstantin Pavlovich Burlachenko, Peter Richtárik
AISTATS 2022 FLIX: A Simple and Communication-Efficient Alternative to Local Methods in Federated Learning Elnur Gasanov, Ahmed Khaled, Samuel Horváth, Peter Richtarik
ICML 2022 Proximal and Federated Random Reshuffling Konstantin Mishchenko, Ahmed Khaled, Peter Richtarik
NeurIPSW 2021 FedMix: A Simple and Communication-Efficient Alternative to Local Methods in Federated Learning Elnur Gasanov, Ahmed Khaled, Samuel Horváth, Peter Richtárik
NeurIPS 2020 Random Reshuffling: Simple Analysis with Vast Improvements Konstantin Mishchenko, Ahmed Khaled, Peter Richtarik
AISTATS 2020 Tighter Theory for Local SGD on Identical and Heterogeneous Data Ahmed Khaled, Konstantin Mishchenko, Peter Richtarik