Predictive Multiplicity in Classification

Abstract

Prediction problems often admit competing models that perform almost equally well. This effect challenges key assumptions in machine learning when competing models assign conflicting predictions. In this paper, we define predictive multiplicity as the ability of a prediction problem to admit competing models with conflicting predictions. We introduce measures to evaluate the severity of predictive multiplicity, and develop integer programming tools to compute these measures exactly for linear classification problems. We apply our tools to measure predictive multiplicity in recidivism prediction problems. Our results show that real-world datasets may admit competing models that assign wildly conflicting predictions, and motivate the need to report predictive multiplicity in model development.

Cite

Text

Marx et al. "Predictive Multiplicity in Classification." International Conference on Machine Learning, 2020.

Markdown

[Marx et al. "Predictive Multiplicity in Classification." International Conference on Machine Learning, 2020.](https://mlanthology.org/icml/2020/marx2020icml-predictive/)

BibTeX

@inproceedings{marx2020icml-predictive,
  title     = {{Predictive Multiplicity in Classification}},
  author    = {Marx, Charles and Calmon, Flavio and Ustun, Berk},
  booktitle = {International Conference on Machine Learning},
  year      = {2020},
  pages     = {6765-6774},
  volume    = {119},
  url       = {https://mlanthology.org/icml/2020/marx2020icml-predictive/}
}