Conformalized Credal Regions for Classification with Ambiguous Ground Truth

Abstract

An open question in Imprecise Probabilistic Machine Learning is how to empirically derive a credal region (i.e., a closed and convex family of probabilities on the output space) from the available data, without any prior knowledge or assumption. In classification problems, credal regions are a tool that is able to provide provable guarantees under realistic assumptions by characterizing the uncertainty about the distribution of the labels. Building on previous work, we show that credal regions can be directly constructed using conformal methods. This allows us to provide a novel extension of classical conformal prediction to problems with ambiguous ground truth, that is, when the exact labels for given inputs are not exactly known. The resulting construction enjoys desirable practical and theoretical properties: (i) conformal coverage guarantees, (ii) smaller prediction sets (compared to classical conformal prediction regions) and (iii) disentanglement of uncertainty sources (epistemic, aleatoric). We empirically verify our findings on both synthetic and real datasets.

Cite

Text

Caprio et al. "Conformalized Credal Regions for Classification with Ambiguous Ground Truth." Transactions on Machine Learning Research, 2025.

Markdown

[Caprio et al. "Conformalized Credal Regions for Classification with Ambiguous Ground Truth." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/caprio2025tmlr-conformalized/)

BibTeX

@article{caprio2025tmlr-conformalized,
  title     = {{Conformalized Credal Regions for Classification with Ambiguous Ground Truth}},
  author    = {Caprio, Michele and Stutz, David and Li, Shuo and Doucet, Arnaud},
  journal   = {Transactions on Machine Learning Research},
  year      = {2025},
  url       = {https://mlanthology.org/tmlr/2025/caprio2025tmlr-conformalized/}
}