Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites Paradox

Abstract

Most existing out-of-distribution (OOD) detection benchmarks classify samples with novel labels as the OOD data. However, some marginal OOD samples actually have close semantic contents to the in-distribution (ID) sample, which makes determining the OOD sample a Sorites Paradox. In this paper, we construct a benchmark named Incremental Shift OOD (IS-OOD) to address the issue, in which we divide the test samples into subsets with different semantic and covariate shift degrees relative to the ID dataset. The data division is achieved through a shift measuring method based on our proposed Language Aligned Image feature Decomposition (LAID). Moreover, we construct a Synthetic Incremental Shift (Syn-IS) dataset that contains high-quality generated images with more diverse covariate contents to complement the IS-OOD benchmark. We evaluate current OOD detection methods on our benchmark and find several important insights: (1) The performance of most OOD detection methods significantly improves as the semantic shift increases; (2) Some methods like GradNorm may have different OOD detection mechanisms as they rely less on semantic shifts to make decisions; (3) Excessive covariate shifts in the image are also likely to be considered as OOD for some methods. Our code and data are released in https://github.com/qqwsad5/IS-OOD.

Cite

Text

Long et al. "Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites Paradox." Neural Information Processing Systems, 2024. doi:10.52202/079017-2852

Markdown

[Long et al. "Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites Paradox." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/long2024neurips-rethinking/) doi:10.52202/079017-2852

BibTeX

@inproceedings{long2024neurips-rethinking,
  title     = {{Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites Paradox}},
  author    = {Long, Xingming and Zhang, Jie and Shan, Shiguang and Chen, Xilin},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-2852},
  url       = {https://mlanthology.org/neurips/2024/long2024neurips-rethinking/}
}