Glocal Smoothness: Line Search and Adaptive Sizes Can Help in Theory Too!
Abstract
Iteration complexities for optimizing smooth functions with first-order algorithms are typically stated in terms of a global Lipschitz constant of the gradient, and near-optimal results are then achieved using fixed step sizes. But many objective functions that arise in practice have regions with small Lipschitz constants where larger step sizes can be used. Many local Lipschitz assumptions have been proposed, which have led to results showing that adaptive step sizes and/or line searches yield improved convergence rates over fixed step sizes. However, these faster rates tend to depend on the iterates of the algorithm, which makes it difficult to compare the iteration complexities of different methods. We consider a simple characterization of global and local ("glocal") smoothness that only depends on properties of the function. This allows upper bounds on iteration complexities in terms of iterate-independent constants and enables us to compare iteration complexities between algorithms. Under this assumption it is straightforward to show the advantages of line searches over fixed step sizes and that, in some settings, gradient descent with line search has a better iteration complexity than accelerated methods with fixed step sizes. We further show that glocal smoothness can lead to improved complexities for the Polyak and AdGD step sizes, as well other algorithms including coordinate optimization, stochastic gradient methods, accelerated gradient methods, and non-linear conjugate gradient methods.
Cite
Text
Fox et al. "Glocal Smoothness: Line Search and Adaptive Sizes Can Help in Theory Too!." Transactions on Machine Learning Research, 2026.Markdown
[Fox et al. "Glocal Smoothness: Line Search and Adaptive Sizes Can Help in Theory Too!." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/fox2026tmlr-glocal/)BibTeX
@article{fox2026tmlr-glocal,
title = {{Glocal Smoothness: Line Search and Adaptive Sizes Can Help in Theory Too!}},
author = {Fox, Curtis and Mishkin, Aaron and Vaswani, Sharan and Schmidt, Mark},
journal = {Transactions on Machine Learning Research},
year = {2026},
url = {https://mlanthology.org/tmlr/2026/fox2026tmlr-glocal/}
}