Pointwise Sampling Uncertainties on the Precision-Recall Curve

Abstract

Quoting robust uncertainties on machine learning (ML) model metrics, such as f1-score, precision, recall, etc., from sources of uncertainty such as data sampling, parameter initialization, and target labelling, is typically not done in the field of data science, even though these are essential for the proper interpretation and comparison of ML models. This text shows how to calculate and visualize the impact of one dominant source of uncertainty - on each point of the Precision-Recall (PR) and Receiver Operating Characteristic (ROC) curves. This is particularly relevant for PR curves, where the joint uncertainty on recall and precision can be large and non-linear, especially at low recall. Four statistical methods to evaluate this uncertainty, both frequentist and Bayesian in origin, are compared in terms of coverage and speed. Of these, Wilks’ toolbox.

Cite

Text

Urlus et al. "Pointwise Sampling Uncertainties on the Precision-Recall Curve." Artificial Intelligence and Statistics, 2023.

Markdown

[Urlus et al. "Pointwise Sampling Uncertainties on the Precision-Recall Curve." Artificial Intelligence and Statistics, 2023.](https://mlanthology.org/aistats/2023/urlus2023aistats-pointwise/)

BibTeX

@inproceedings{urlus2023aistats-pointwise,
  title     = {{Pointwise Sampling Uncertainties on the Precision-Recall Curve}},
  author    = {Urlus, Ralph E.Q. and Baak, Max and Collot, Stéphane and Fridman Rojas, Ilan},
  booktitle = {Artificial Intelligence and Statistics},
  year      = {2023},
  pages     = {8211-8232},
  volume    = {206},
  url       = {https://mlanthology.org/aistats/2023/urlus2023aistats-pointwise/}
}