Armijo Line-Search Can Make (Stochastic) Gradient Descent Provably Faster
Abstract
Armijo line-search (Armijo-LS) is a standard method to set the step-size for gradient descent (GD). For smooth functions, Armijo-LS alleviates the need to know the global smoothness constant $L$ and adapts to the “local” smoothness, enabling GD to converge faster. Existing theoretical analyses show that GD with Armijo-LS ($\texttt{GD-LS}$) can result in constant factor improvements over GD with a $1/L$ step-size (denoted as $\texttt{GD(1/L)}$). We strengthen these results and show that if the objective function satisfies a certain non-uniform smoothness condition, $\texttt{GD-LS}$ can result in a faster convergence rate than $\texttt{GD(1/L)}$. In particular, we prove that for convex objectives corresponding to logistic regression and multi-class classification, $\texttt{GD-LS}$ can converge to the optimum at a linear rate, and hence improves over the sublinear convergence of $\texttt{GD(1/L)}$. Furthermore, for non-convex objectives satisfying gradient domination (e.g., those corresponding to the softmax policy gradient in RL or generalized linear models with a logistic link function), $\texttt{GD-LS}$ can match the fast convergence of algorithms tailored for these specific settings. Finally, we analyze the convergence of stochastic GD with a stochastic line-search on convex losses under the interpolation assumption.
Cite
Text
Vaswani and Babanezhad Harikandeh. "Armijo Line-Search Can Make (Stochastic) Gradient Descent Provably Faster." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Vaswani and Babanezhad Harikandeh. "Armijo Line-Search Can Make (Stochastic) Gradient Descent Provably Faster." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/vaswani2025icml-armijo/)BibTeX
@inproceedings{vaswani2025icml-armijo,
title = {{Armijo Line-Search Can Make (Stochastic) Gradient Descent Provably Faster}},
author = {Vaswani, Sharan and Babanezhad Harikandeh, Reza},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {61035-61070},
volume = {267},
url = {https://mlanthology.org/icml/2025/vaswani2025icml-armijo/}
}