Visual Data Diagnosis and Debiasing with Concept Graphs

Abstract

The widespread success of deep learning models today is owed to the curation of extensive datasets significant in size and complexity. However, such models frequently pick up inherent biases in the data during the training process, leading to unreliable predictions. Diagnosing and debiasing datasets is thus a necessity to ensure reliable model performance. In this paper, we present ConBias, a novel framework for diagnosing and mitigating Concept co-occurrence Biases in visual datasets. ConBias represents visual datasets as knowledge graphs of concepts, enabling meticulous analysis of spurious concept co-occurrences to uncover concept imbalances across the whole dataset. Moreover, we show that by employing a novel clique-based concept balancing strategy, we can mitigate these imbalances, leading to enhanced performance on downstream tasks. Extensive experiments show that data augmentation based on a balanced concept distribution augmented by ConBias improves generalization performance across multiple datasets compared to state-of-the-art methods.

Cite

Text

Chakraborty et al. "Visual Data Diagnosis and Debiasing with Concept Graphs." Neural Information Processing Systems, 2024. doi:10.52202/079017-3376

Markdown

[Chakraborty et al. "Visual Data Diagnosis and Debiasing with Concept Graphs." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/chakraborty2024neurips-visual/) doi:10.52202/079017-3376

BibTeX

@inproceedings{chakraborty2024neurips-visual,
  title     = {{Visual Data Diagnosis and Debiasing with Concept Graphs}},
  author    = {Chakraborty, Rwiddhi and Wang, Yinong and Gao, Jialu and Zheng, Runkai and Zhang, Cheng and De la Torre, Fernando},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-3376},
  url       = {https://mlanthology.org/neurips/2024/chakraborty2024neurips-visual/}
}