Spawrious: A Benchmark for Fine Control of Spurious Correlation Biases

Abstract

The problem of spurious correlations (SCs) arises when a classifier relies on non-predictive features that happen to be correlated with the labels in the training data. Previous SC benchmark datasets suffer from varying issues, e.g., over-saturation or only containing one-to-one (O2O) SCs, but no many-to-many (M2M) SCs arising between groups of spurious attributes and classes. In this paper, we present Spawrious-\{O2O, M2M\}-\{Easy, Medium, Hard\}, an image classification benchmark suite containing spurious correlations between classes and backgrounds. We employ a text-to-image model to generate photo-realistic images and an image captioning model to filter out unsuitable ones. The resulting dataset is of high quality and contains approximately 152k images. Our experimental results demonstrate that state-of-the-art group robustness methods struggle with Spawrious.

Cite

Text

Lynch et al. "Spawrious: A Benchmark for Fine Control of Spurious Correlation Biases." ICLR 2025 Workshops: SCSL, 2025.

Markdown

[Lynch et al. "Spawrious: A Benchmark for Fine Control of Spurious Correlation Biases." ICLR 2025 Workshops: SCSL, 2025.](https://mlanthology.org/iclrw/2025/lynch2025iclrw-spawrious/)

BibTeX

@inproceedings{lynch2025iclrw-spawrious,
  title     = {{Spawrious: A Benchmark for Fine Control of Spurious Correlation Biases}},
  author    = {Lynch, Aengus and Dovonon, Gbetondji Jean-Sebastien and Kaddour, Jean and Silva, Ricardo},
  booktitle = {ICLR 2025 Workshops: SCSL},
  year      = {2025},
  url       = {https://mlanthology.org/iclrw/2025/lynch2025iclrw-spawrious/}
}