Label Augmentation as Inter-Class Data Augmentation for Conditional Image Synthesis with Imbalanced Data

Abstract

Conditional image synthesis performs admirably when trained on well-constructed and balanced datasets. However, in practice, training datasets frequently contain minorities (i.e., a class with a few samples), known as imbalanced data, which causes difficulties in learning generative models. To address conditional image synthesis with imbalanced data, we analyze a diversity issue of label-preserving data augmentation and an affinity issue of non-label-preserving data augmentation. From this observation, we present label augmentation, which works as inter-class data augmentation that effectively augments data by predicting a new label for a given image using the prediction of a pretrained image classification model (i.e., probabilities for each class). We incorporate our label augmentation into the discriminator of a seminal conditional generative adversarial network (GAN) model, proposing Softlabel-GAN. Using class probabilities extracts class-invariant and shared features between similar classes, achieving data augmentation with high affinity and diversity. Our experiments on imbalanced datasets show that Softlabel-GAN produces images with high quality and diversity while being hardly affected by the number of samples in each class. Code: https://github.com/raven38/softlabel-gan.

Cite

Text

Katsumata et al. "Label Augmentation as Inter-Class Data Augmentation for Conditional Image Synthesis with Imbalanced Data." Winter Conference on Applications of Computer Vision, 2024.

Markdown

[Katsumata et al. "Label Augmentation as Inter-Class Data Augmentation for Conditional Image Synthesis with Imbalanced Data." Winter Conference on Applications of Computer Vision, 2024.](https://mlanthology.org/wacv/2024/katsumata2024wacv-label/)

BibTeX

@inproceedings{katsumata2024wacv-label,
  title     = {{Label Augmentation as Inter-Class Data Augmentation for Conditional Image Synthesis with Imbalanced Data}},
  author    = {Katsumata, Kai and Vo, Duc Minh and Nakayama, Hideki},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2024},
  pages     = {4944-4953},
  url       = {https://mlanthology.org/wacv/2024/katsumata2024wacv-label/}
}