TagFog: Textual Anchor Guidance and Fake Outlier Generation for Visual Out-of-Distribution Detection

Abstract

Out-of-distribution (OOD) detection is crucial in many real-world applications. However, intelligent models are often trained solely on in-distribution (ID) data, leading to overconfidence when misclassifying OOD data as ID classes. In this study, we propose a new learning framework which leverage simple Jigsaw-based fake OOD data and rich semantic embeddings (`anchors') from the ChatGPT description of ID knowledge to help guide the training of the image encoder. The learning framework can be flexibly combined with existing post-hoc approaches to OOD detection, and extensive empirical evaluations on multiple OOD detection benchmarks demonstrate that rich textual representation of ID knowledge and fake OOD knowledge can well help train a visual encoder for OOD detection. With the learning framework, new state-of-the-art performance was achieved on all the benchmarks. The code is available at https://github.com/Cverchen/TagFog.

Cite

Text

Chen et al. "TagFog: Textual Anchor Guidance and Fake Outlier Generation for Visual Out-of-Distribution Detection." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I2.27871

Markdown

[Chen et al. "TagFog: Textual Anchor Guidance and Fake Outlier Generation for Visual Out-of-Distribution Detection." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/chen2024aaai-tagfog/) doi:10.1609/AAAI.V38I2.27871

BibTeX

@inproceedings{chen2024aaai-tagfog,
  title     = {{TagFog: Textual Anchor Guidance and Fake Outlier Generation for Visual Out-of-Distribution Detection}},
  author    = {Chen, Jiankang and Zhang, Tong and Zheng, Wei-Shi and Wang, Ruixuan},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {1100-1109},
  doi       = {10.1609/AAAI.V38I2.27871},
  url       = {https://mlanthology.org/aaai/2024/chen2024aaai-tagfog/}
}