Split Conformal Prediction and Non-Exchangeable Data

Abstract

Split conformal prediction (CP) is arguably the most popular CP method for uncertainty quantification, enjoying both academic interest and widespread deployment. However, the original theoretical analysis of split CP makes the crucial assumption of data exchangeability, which hinders many real-world applications. In this paper, we present a novel theoretical framework based on concentration inequalities and decoupling properties of the data, proving that split CP remains valid for many non-exchangeable processes by adding a small coverage penalty. Through experiments with both real and synthetic data, we show that our theoretical results translate to good empirical performance under non-exchangeability, e.g., for time series and spatiotemporal data. Compared to recent conformal algorithms designed to counter specific exchangeability violations, we show that split CP is competitive in terms of coverage and interval size, with the benefit of being extremely simple and orders of magnitude faster than alternatives.

Cite

Text

Oliveira et al. "Split Conformal Prediction and Non-Exchangeable Data." Journal of Machine Learning Research, 2024.

Markdown

[Oliveira et al. "Split Conformal Prediction and Non-Exchangeable Data." Journal of Machine Learning Research, 2024.](https://mlanthology.org/jmlr/2024/oliveira2024jmlr-split/)

BibTeX

@article{oliveira2024jmlr-split,
  title     = {{Split Conformal Prediction and Non-Exchangeable Data}},
  author    = {Oliveira, Roberto I. and Orenstein, Paulo and Ramos, Thiago and Romano, João Vitor},
  journal   = {Journal of Machine Learning Research},
  year      = {2024},
  pages     = {1-38},
  volume    = {25},
  url       = {https://mlanthology.org/jmlr/2024/oliveira2024jmlr-split/}
}