Stochastic Coordinate Descent Methods for Regularized Smooth and Nonsmooth Losses

Abstract

Stochastic Coordinate Descent (SCD) methods are among the first optimization schemes suggested for efficiently solving large scale problems. However, until now, there exists a gap between the convergence rate analysis and practical SCD algorithms for general smooth losses and there is no primal SCD algorithm for nonsmooth losses. In this paper, we discuss these issues using the recently developed structural optimization techniques. In particular, we first present a principled and practical SCD algorithm for regularized smooth losses, in which the one-variable subproblem is solved using the proximal gradient method and the adaptive componentwise Lipschitz constant is obtained employing the line search strategy. When the loss is nonsmooth, we present a novel SCD algorithm, in which the one-variable subproblem is solved using the dual averaging method. We show that our algorithms exploit the regularization structure and achieve several optimal convergence rates that are standard in the literature. The experiments demonstrate the expected efficiency of our SCD algorithms in both smooth and nonsmooth cases.

Cite

Text

Tao et al. "Stochastic Coordinate Descent Methods for Regularized Smooth and Nonsmooth Losses." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2012. doi:10.1007/978-3-642-33460-3_40

Markdown

[Tao et al. "Stochastic Coordinate Descent Methods for Regularized Smooth and Nonsmooth Losses." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2012.](https://mlanthology.org/ecmlpkdd/2012/tao2012ecmlpkdd-stochastic/) doi:10.1007/978-3-642-33460-3_40

BibTeX

@inproceedings{tao2012ecmlpkdd-stochastic,
  title     = {{Stochastic Coordinate Descent Methods for Regularized Smooth and Nonsmooth Losses}},
  author    = {Tao, Qing and Kong, Kang and Chu, Dejun and Wu, Gao-wei},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2012},
  pages     = {537-552},
  doi       = {10.1007/978-3-642-33460-3_40},
  url       = {https://mlanthology.org/ecmlpkdd/2012/tao2012ecmlpkdd-stochastic/}
}