Self Supervised Learning Using Controlled Diffusion Image Augmentation

Goldfeder, Judah A; Puma, Patrick Minwan; Guo, Gabriel; Trigo, Gabriel Guerra; Lipson, Hod

Self Supervised Learning Using Controlled Diffusion Image Augmentation

Judah A Goldfeder, Patrick Minwan Puma, Gabriel Guo, Gabriel Guerra Trigo, Hod Lipson

NeurIPSW 2024

/neuripsw/2024/goldfeder2024neuripsw-self/

Abstract

While synthetic data generated through diffusion models has been shown to improve performance across various tasks, existing approaches face two challenges: the necessity of fine-tuning a diffusion model for a specific dataset is often expensive, and the domain gap between real and synthetic data limits synthetic data's usefulness, especially in fine-grained classification settings. To mitigate these shortcomings, we developed CDaug, a novel approach to data augmentation utilizing controlled diffusion. Instead of utilizing diffusion models to generate wholly new images, we take a self-supervised approach and condition the generated images on existing data, allowing us to create high quality synthetic images/augmentations that capture the semantic priors and underlying structure of the data while infusing meaningful and novel variations with no human intervention. We developed a pipeline that utilizes ControlNet, conditioned on the original data, and captions generated by the multi-modal LLM LLaVA2 to guide the generative process. Our work uses open-source models, does not require fine-tuning, and is modular. We demonstrate improved performance across 7 fine-grained datasets, in both few-shot and full dataset settings, across many architectures.

PDF NeurIPSW OpenReview Semantic Scholar

Cite

Text

Goldfeder et al. "Self Supervised Learning Using Controlled Diffusion Image Augmentation." NeurIPS 2024 Workshops: SSL, 2024.

Markdown

[Goldfeder et al. "Self Supervised Learning Using Controlled Diffusion Image Augmentation." NeurIPS 2024 Workshops: SSL, 2024.](https://mlanthology.org/neuripsw/2024/goldfeder2024neuripsw-self/)

BibTeX

@inproceedings{goldfeder2024neuripsw-self,
  title     = {{Self Supervised Learning Using Controlled Diffusion Image Augmentation}},
  author    = {Goldfeder, Judah A and Puma, Patrick Minwan and Guo, Gabriel and Trigo, Gabriel Guerra and Lipson, Hod},
  booktitle = {NeurIPS 2024 Workshops: SSL},
  year      = {2024},
  url       = {https://mlanthology.org/neuripsw/2024/goldfeder2024neuripsw-self/}
}