Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models

Abstract

Image classifiers should be used with caution in the real world. Performance evaluated on a validation set may not reflect performance in the real world. In particular, classifiers may perform well for conditions that are frequently encountered during training, but poorly for other infrequent conditions. In this study, we hypothesize that recent advances in text-to-image generative models make them valuable for benchmarking computer vision models such as image classifiers: they can generate images conditioned by textual prompts that cause classifier failures, allowing failure conditions to be described with textual attributes. However, their generation cost becomes an issue when a large number of synthetic images need to be generated, which is the case when many different attribute combinations need to be tested. We propose an image classifier benchmarking method as an iterative process that alternates image generation, classifier evaluation, and attribute selection. This method efficiently explores the attributes that ultimately lead to poor behavior detection.

Cite

Text

Le-Coz et al. "Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00752

Markdown

[Le-Coz et al. "Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/lecoz2024cvprw-efficient/) doi:10.1109/CVPRW63382.2024.00752

BibTeX

@inproceedings{lecoz2024cvprw-efficient,
  title     = {{Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models}},
  author    = {Le-Coz, Adrien and Ouertatani, Houssem and Herbin, Stéphane and Adjed, Faouzi},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2024},
  pages     = {7569-7578},
  doi       = {10.1109/CVPRW63382.2024.00752},
  url       = {https://mlanthology.org/cvprw/2024/lecoz2024cvprw-efficient/}
}