PAC-Bayesian AUC Classification and Scoring

Abstract

We develop a scoring and classification procedure based on the PAC-Bayesian approach and the AUC (Area Under Curve) criterion. We focus initially on the class of linear score functions. We derive PAC-Bayesian non-asymptotic bounds for two types of prior for the score parameters: a Gaussian prior, and a spike-and-slab prior; the latter makes it possible to perform feature selection. One important advantage of our approach is that it is amenable to powerful Bayesian computational tools. We derive in particular a Sequential Monte Carlo algorithm, as an efficient method which may be used as a gold standard, and an Expectation-Propagation algorithm, as a much faster but approximate method. We also extend our method to a class of non-linear score functions, essentially leading to a nonparametric procedure, by considering a Gaussian process prior.

Cite

Text

Ridgway et al. "PAC-Bayesian AUC Classification and Scoring." Neural Information Processing Systems, 2014.

Markdown

[Ridgway et al. "PAC-Bayesian AUC Classification and Scoring." Neural Information Processing Systems, 2014.](https://mlanthology.org/neurips/2014/ridgway2014neurips-pacbayesian/)

BibTeX

@inproceedings{ridgway2014neurips-pacbayesian,
  title     = {{PAC-Bayesian AUC Classification and Scoring}},
  author    = {Ridgway, James and Alquier, Pierre and Chopin, Nicolas and Liang, Feng},
  booktitle = {Neural Information Processing Systems},
  year      = {2014},
  pages     = {658-666},
  url       = {https://mlanthology.org/neurips/2014/ridgway2014neurips-pacbayesian/}
}