DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation
Abstract
Reliably detecting when a deployed machine learning model is likely to fail on a given input is crucial for ensuring safe operation. In this work, we propose DECIDER (Debiasing Classifiers to Identify Errors Reliably), a novel approach that leverages priors from large language models (LLMs) and vision-language models (VLMs) to detect failures in image classification models. DECIDER utilizes LLMs to specify task-relevant core attributes and constructs a “debiased” version of the classifier by aligning its visual features to these core attributes using a VLM, and detects potential failure by measuring disagreement between the original and debiased models. In addition to proactively identifying samples on which the model would fail, DECIDER also provides human-interpretable explanations for failure through a novel attribute-ablation strategy. Through extensive experiments across diverse benchmarks spanning subpopulation shifts (spurious correlations, class imbalance) and covariate shifts (synthetic corruptions, domain shifts), DECIDER consistently achieves state-of-the-art failure detection performance, significantly outperforming baselines in terms of the overall Matthews correlation coefficient as well as failure and success recall. Our codes can be accessed at https://github.com/kowshikthopalli/DECIDER/
Cite
Text
Subramanyam et al. "DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72986-7_27Markdown
[Subramanyam et al. "DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/subramanyam2024eccv-decider/) doi:10.1007/978-3-031-72986-7_27BibTeX
@inproceedings{subramanyam2024eccv-decider,
title = {{DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation}},
author = {Subramanyam, Rakshith and Thopalli, Kowshik and Narayanaswamy, Vivek Sivaraman and Thiagarajan, Jayaraman J.},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-72986-7_27},
url = {https://mlanthology.org/eccv/2024/subramanyam2024eccv-decider/}
}