TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models

Abstract

Text-to-Image (TTI) generative models have shown great progress in the past few years in terms of their ability to generate complex and high-quality imagery. At the same time, these models have been shown to suffer from harmful biases, including exaggerated societal biases (e.g., gender, ethnicity), as well as incidental correlations that limit such a model’s ability to generate more diverse imagery. In this paper, we propose a general approach to study and quantify a broad spectrum of biases, for any TTI model and for any prompt, using counterfactual reasoning. Unlike other works that evaluate generated images on a predefined set of bias axes, our approach automatically identifies potential biases that might be relevant to the given prompt, and measures those biases. In addition, we complement quantitative scores with post-hoc explanations in terms of semantic concepts in the images generated. We show that our method is uniquely capable of explaining complex multi-dimensional biases through semantic concepts, as well as the intersectionality between different biases for any given prompt. We perform extensive user studies to illustrate that the results of our method and analysis are consistent with human judgements.1 1 Data and code is available at https://tibet-ai.github.io. contribution. ∗ indicates equal

Cite

Text

Chinchure et al. "TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72986-7_25

Markdown

[Chinchure et al. "TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/chinchure2024eccv-tibet/) doi:10.1007/978-3-031-72986-7_25

BibTeX

@inproceedings{chinchure2024eccv-tibet,
  title     = {{TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models}},
  author    = {Chinchure, Aditya and Shukla, Pushkar and Bhatt, Gaurav and Salij, Kiri and Hosanagar, Kartik and Sigal, Leonid and Turk, Matthew},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-72986-7_25},
  url       = {https://mlanthology.org/eccv/2024/chinchure2024eccv-tibet/}
}