Third-Order Smoothness Helps: Faster Stochastic Optimization Algorithms for Finding Local Minima

Yu, Yaodong; Xu, Pan; Gu, Quanquan

Third-Order Smoothness Helps: Faster Stochastic Optimization Algorithms for Finding Local Minima

NeurIPS 2018 pp. 4525-4535

/neurips/2018/yu2018neurips-thirdorder/

Abstract

We propose stochastic optimization algorithms that can find local minima faster than existing algorithms for nonconvex optimization problems, by exploiting the third-order smoothness to escape non-degenerate saddle points more efficiently. More specifically, the proposed algorithm only needs $\tilde{O}(\epsilon^{-10/3})$ stochastic gradient evaluations to converge to an approximate local minimum $\mathbf{x}$, which satisfies $\|\nabla f(\mathbf{x})\|_2\leq\epsilon$ and $\lambda_{\min}(\nabla^2 f(\mathbf{x}))\geq -\sqrt{\epsilon}$ in unconstrained stochastic optimization, where $\tilde{O}(\cdot)$ hides logarithm polynomial terms and constants. This improves upon the $\tilde{O}(\epsilon^{-7/2})$ gradient complexity achieved by the state-of-the-art stochastic local minima finding algorithms by a factor of $\tilde{O}(\epsilon^{-1/6})$. Experiments on two nonconvex optimization problems demonstrate the effectiveness of our algorithm and corroborate our theory.

PDF NeurIPS Semantic Scholar

Cite

Text

Yu et al. "Third-Order Smoothness Helps: Faster Stochastic Optimization Algorithms for Finding Local Minima." Neural Information Processing Systems, 2018.

Markdown

[Yu et al. "Third-Order Smoothness Helps: Faster Stochastic Optimization Algorithms for Finding Local Minima." Neural Information Processing Systems, 2018.](https://mlanthology.org/neurips/2018/yu2018neurips-thirdorder/)

BibTeX

@inproceedings{yu2018neurips-thirdorder,
  title     = {{Third-Order Smoothness Helps: Faster Stochastic Optimization Algorithms for Finding Local Minima}},
  author    = {Yu, Yaodong and Xu, Pan and Gu, Quanquan},
  booktitle = {Neural Information Processing Systems},
  year      = {2018},
  pages     = {4525-4535},
  url       = {https://mlanthology.org/neurips/2018/yu2018neurips-thirdorder/}
}