Fast Greedy C-Bound Minimization with Guarantees

Abstract

The $\mathcal {C}$ C -bound is a tight bound on the true risk of a majority vote classifier that relies on the individual quality and pairwise disagreement of the voters and provides PAC-Bayesian generalization guarantees. Based on this bound, MinCq is a classification algorithm that returns a dense distribution on a finite set of voters by minimizing it. Introduced later and inspired by boosting, CqBoost uses a column generation approach to build a sparse $\mathcal {C}$ C -bound optimal distribution on a possibly infinite set of voters. However, both approaches have a high computational learning time because they minimize the $\mathcal {C}$ C -bound by solving a quadratic program. Yet, one advantage of CqBoost is its experimental ability to provide sparse solutions. In this work, we address the problem of accelerating the $\mathcal {C}$ C -bound minimization process while keeping the sparsity of the solution and without losing accuracy. We present CB-Boost, a computationally efficient classification algorithm relying on a greedy–boosting-based– $\mathcal {C}$ C -bound optimization. An in-depth analysis proves the optimality of the greedy minimization process and quantifies the decrease of the $\mathcal {C}$ C -bound operated by the algorithm. Generalization guarantees are then drawn based on already existing PAC-Bayesian theorems. In addition, we experimentally evaluate the relevance of CB-Boost in terms of the three main properties we expect about it: accuracy, sparsity, and computational efficiency compared to MinCq, CqBoost, Adaboost and other ensemble methods. As observed in these experiments, CB-Boost not only achieves results comparable to the state of the art, but also provides $\mathcal {C}$ C -bound sub-optimal weights with very few computational demand while keeping the sparsity property of CqBoost.

Cite

Text

Bauvin et al. "Fast Greedy C-Bound Minimization with Guarantees." Machine Learning, 2020. doi:10.1007/S10994-020-05902-7

Markdown

[Bauvin et al. "Fast Greedy C-Bound Minimization with Guarantees." Machine Learning, 2020.](https://mlanthology.org/mlj/2020/bauvin2020mlj-fast/) doi:10.1007/S10994-020-05902-7

BibTeX

@article{bauvin2020mlj-fast,
  title     = {{Fast Greedy C-Bound Minimization with Guarantees}},
  author    = {Bauvin, Baptiste and Capponi, Cécile and Roy, Jean-Francis and Laviolette, François},
  journal   = {Machine Learning},
  year      = {2020},
  pages     = {1945-1986},
  doi       = {10.1007/S10994-020-05902-7},
  volume    = {109},
  url       = {https://mlanthology.org/mlj/2020/bauvin2020mlj-fast/}
}