An Efficient Primal Dual Prox Method for Non-Smooth Optimization

Abstract

We study the non-smooth optimization problems in machine learning, where both the loss function and the regularizer are non-smooth functions. Previous studies on efficient empirical loss minimization assume either a smooth loss function or a strongly convex regularizer, making them unsuitable for non-smooth optimization. We develop a simple yet efficient method for a family of non-smooth optimization problems where the dual form of the loss function is bilinear in primal and dual variables. We cast a non-smooth optimization problem into a minimax optimization problem, and develop a primal dual prox method that solves the minimax optimization problem at a rate of $O(1/T)$ O ( 1 / T ) assuming that the proximal step can be efficiently solved, significantly faster than a standard subgradient descent method that has an $O(1/\sqrt{T})$ O ( 1 / T ) convergence rate. Our empirical studies verify the efficiency of the proposed method for various non-smooth optimization problems that arise ubiquitously in machine learning by comparing it to the state-of-the-art first order methods.

Cite

Text

Yang et al. "An Efficient Primal Dual Prox Method for Non-Smooth Optimization." Machine Learning, 2015. doi:10.1007/S10994-014-5436-1

Markdown

[Yang et al. "An Efficient Primal Dual Prox Method for Non-Smooth Optimization." Machine Learning, 2015.](https://mlanthology.org/mlj/2015/yang2015mlj-efficient/) doi:10.1007/S10994-014-5436-1

BibTeX

@article{yang2015mlj-efficient,
  title     = {{An Efficient Primal Dual Prox Method for Non-Smooth Optimization}},
  author    = {Yang, Tianbao and Mahdavi, Mehrdad and Jin, Rong and Zhu, Shenghuo},
  journal   = {Machine Learning},
  year      = {2015},
  pages     = {369-406},
  doi       = {10.1007/S10994-014-5436-1},
  volume    = {98},
  url       = {https://mlanthology.org/mlj/2015/yang2015mlj-efficient/}
}