A Bregman Divergence View on the Difference-of-Convex Algorithm

Abstract

The difference of convex (DC) algorithm is a conceptually simple method for the minimization of (non)convex functions that are expressed as the difference of two convex functions. An attractive feature of the algorithm is that it maintains a global overestimator on the function and does not require a choice of step size at each iteration. By adopting a Bregman divergence point of view, we simplify and strengthen many existing non-asymptotic convergence guarantees for the DC algorithm. We further present several sufficient conditions that ensure a linear convergence rate, namely a new DC Polyak-Lojasiewicz condition, as well as a relative strong convexity assumption. Importantly, our conditions do not require smoothness of the objective function. We illustrate our results on a family of minimization problems involving the quantum relative entropy, with applications in quantum information theory.

Cite

Text

Faust et al. "A Bregman Divergence View on the Difference-of-Convex Algorithm." Artificial Intelligence and Statistics, 2023.

Markdown

[Faust et al. "A Bregman Divergence View on the Difference-of-Convex Algorithm." Artificial Intelligence and Statistics, 2023.](https://mlanthology.org/aistats/2023/faust2023aistats-bregman/)

BibTeX

@inproceedings{faust2023aistats-bregman,
  title     = {{A Bregman Divergence View on the Difference-of-Convex Algorithm}},
  author    = {Faust, Oisin and Fawzi, Hamza and Saunderson, James},
  booktitle = {Artificial Intelligence and Statistics},
  year      = {2023},
  pages     = {3427-3439},
  volume    = {206},
  url       = {https://mlanthology.org/aistats/2023/faust2023aistats-bregman/}
}