Adaptive Gradient Normalization and Independent Sampling for (Stochastic) Generalized-Smooth Optimization

Abstract

Recent studies have shown that many nonconvex machine learning problems satisfy a generalized-smooth condition that extends beyond traditional smooth nonconvex optimization. However, the existing algorithms are not fully adapted to such generalized-smooth nonconvex geometry and encounter significant technical limitations on their convergence analysis. In this work, we first analyze the convergence of adaptively normalized gradient descent under function geometries characterized by generalized-smoothness and the generalized PL condition, revealing the advantage of adaptive gradient normalization. Our results provide theoretical insights into adaptive normalization across various scenarios. For stochastic generalized-smooth nonconvex optimization, we propose the Independent-Adaptively Normalized Stochastic Gradient Descent algorithm, which leverages adaptive gradient normalization, independent sampling, and gradient clipping to achieve an $\mathcal{O}(\epsilon^{-4})$ sample complexity under relaxed noise assumptions. Experiments on large-scale nonconvex generalized-smooth problems demonstrate the fast convergence of our algorithm.

Cite

Text

Yang et al. "Adaptive Gradient Normalization and Independent Sampling for (Stochastic) Generalized-Smooth Optimization." Transactions on Machine Learning Research, 2025.

Markdown

[Yang et al. "Adaptive Gradient Normalization and Independent Sampling for (Stochastic) Generalized-Smooth Optimization." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/yang2025tmlr-adaptive/)

BibTeX

@article{yang2025tmlr-adaptive,
  title     = {{Adaptive Gradient Normalization and Independent Sampling for (Stochastic) Generalized-Smooth Optimization}},
  author    = {Yang, Yufeng and Tripp, Erin E. and Sun, Yifan and Zou, Shaofeng and Zhou, Yi},
  journal   = {Transactions on Machine Learning Research},
  year      = {2025},
  url       = {https://mlanthology.org/tmlr/2025/yang2025tmlr-adaptive/}
}