Confidence Intervals and Simultaneous Confidence Bands Based on Deep Learning
Abstract
Deep learning models have significantly improved prediction accuracy in various fields, gaining recognition across numerous disciplines. Yet, an aspect of deep learning that remains insufficiently addressed is the assessment of prediction uncertainty. Producing reliable uncertainty estimators could be crucial in practical terms. For instance, predictions associated with a high degree of uncertainty could be sent for further evaluation. Recent works in uncertainty quantification of deep learning predictions, including Bayesian posterior credible intervals and a frequentist confidence-interval estimation, have proven to yield either invalid or overly conservative intervals. Furthermore, there is currently no method for quantifying uncertainty that can accommodate deep neural networks for survival (time-to-event) data that involves right-censored outcomes. In this work, we provide a non-parametric bootstrap method that disentangles data uncertainty from the noise inherent in the adopted optimization algorithm. %, ensuring that based on deep learning estimators with small bias, the resulting point-wise confidence intervals or the simultaneous confidence bands are accurate (i.e., valid and not overly conservative). The validity of the proposed approach is demonstrated through an extensive simulation study, which shows that the method is accurate (i.e., valid and not overly conservative) as long as the network is sufficiently deep to ensure that the estimators provided by the deep neural network exhibit minimal bias. Otherwise, undercoverage of up to 8\% is observed. The proposed ad-hoc method can be easily integrated into any deep neural network without interfering with the training process. The utility of the proposed approach is demonstrated through two applications: constructing simultaneous confidence bands for survival curves generated by deep neural networks dealing with right-censored survival data, and constructing a confidence interval for classification probabilities in the context of binary classification regression. Code for the data analysis and reported simulation is available at Githubsite: \url{https://github.com/Asafba123/Survival_bootstrap}.
Cite
Text
Arie and Gorfine. "Confidence Intervals and Simultaneous Confidence Bands Based on Deep Learning." Transactions on Machine Learning Research, 2024.Markdown
[Arie and Gorfine. "Confidence Intervals and Simultaneous Confidence Bands Based on Deep Learning." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/arie2024tmlr-confidence/)BibTeX
@article{arie2024tmlr-confidence,
title = {{Confidence Intervals and Simultaneous Confidence Bands Based on Deep Learning}},
author = {Arie, Asaf Ben and Gorfine, Malka},
journal = {Transactions on Machine Learning Research},
year = {2024},
url = {https://mlanthology.org/tmlr/2024/arie2024tmlr-confidence/}
}