For Robust Worst-Group Accuracy, Ignore Group Annotations

Abstract

Existing methods for last layer retraining that aim to optimize worst-group accuracy (WGA) rely heavily on well-annotated groups in the training data. We show, both in theory and practice, that annotation-based data augmentations using either downsampling or upweighting for WGA are susceptible to domain annotation noise. The WGA gap is exacerbated in high-noise regimes for models trained with vanilla empirical risk minimization (ERM). To this end, we introduce Regularized Annotation of Domains (RAD) to train robust last layer classifiers without needing explicit domain annotations. Our results show that RAD is competitive with other recently proposed domain annotation-free techniques. Most importantly, RAD outperforms state-of-the-art annotation-reliant methods even with only 5\% noise in the training data for several publicly available datasets.

Cite

Text

Stromberg et al. "For Robust Worst-Group Accuracy, Ignore Group Annotations." Transactions on Machine Learning Research, 2024.

Markdown

[Stromberg et al. "For Robust Worst-Group Accuracy, Ignore Group Annotations." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/stromberg2024tmlr-robust/)

BibTeX

@article{stromberg2024tmlr-robust,
  title     = {{For Robust Worst-Group Accuracy, Ignore Group Annotations}},
  author    = {Stromberg, Nathan and Ayyagari, Rohan and Welfert, Monica and Koyejo, Sanmi and Nock, Richard and Sankar, Lalitha},
  journal   = {Transactions on Machine Learning Research},
  year      = {2024},
  url       = {https://mlanthology.org/tmlr/2024/stromberg2024tmlr-robust/}
}