Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization

Umut Simsekli, Cagatay Yildiz, Than Huy Nguyen, Taylan Cemgil, Gael Richard

ICML 2018 pp. 4674-4683

/icml/2018/simsekli2018icml-asynchronous/

Abstract

Recent studies have illustrated that stochastic gradient Markov Chain Monte Carlo techniques have a strong potential in non-convex optimization, where local and global convergence guarantees can be shown under certain conditions. By building up on this recent theory, in this study, we develop an asynchronous-parallel stochastic L-BFGS algorithm for non-convex optimization. The proposed algorithm is suitable for both distributed and shared-memory settings. We provide formal theoretical analysis and show that the proposed method achieves an ergodic convergence rate of ${\cal O}(1/\sqrt{N})$ ($N$ being the total number of iterations) and it can achieve a linear speedup under certain conditions. We perform several experiments on both synthetic and real datasets. The results support our theory and show that the proposed algorithm provides a significant speedup over the recently proposed synchronous distributed L-BFGS algorithm.

PDF ICML Semantic Scholar

Cite

Text

Simsekli et al. "Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization." International Conference on Machine Learning, 2018.

Markdown

[Simsekli et al. "Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization." International Conference on Machine Learning, 2018.](https://mlanthology.org/icml/2018/simsekli2018icml-asynchronous/)

BibTeX

@inproceedings{simsekli2018icml-asynchronous,
  title     = {{Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization}},
  author    = {Simsekli, Umut and Yildiz, Cagatay and Nguyen, Than Huy and Cemgil, Taylan and Richard, Gael},
  booktitle = {International Conference on Machine Learning},
  year      = {2018},
  pages     = {4674-4683},
  volume    = {80},
  url       = {https://mlanthology.org/icml/2018/simsekli2018icml-asynchronous/}
}