Inversion Circle Interpolation: Diffusion-Based Image Augmentation for Data-Scarce Classification

Abstract

Data Augmentation (DA), i.e., synthesizing faithful and diverse samples to expand the original training set, is a prevalent and effective strategy to improve the performance of various data-scarce tasks. With the powerful image generation ability, diffusion-based DA has shown strong performance gains on different image classification benchmarks. In this paper, we analyze today's diffusion-based DA methods, and argue that they cannot take account of both faithfulness and diversity, which are two critical keys for generating high-quality samples and boosting classification performance. To this end, we propose a novel Diffusion-based DA method: Diff-II. Specifically, it consists of three steps: 1) Category concepts learning: Learning concept embeddings for each category. 2) Inversion interpolation: Calculating the inversion for each image, and conducting circle interpolation for two randomly sampled inversions from the same category. 3) Two-stage denoising: Using different prompts to generate synthesized images in a coarse-to-fine manner. Extensive experiments on various data-scarce image classification tasks (e.g., few-shot, long-tailed, and out-of-distribution classification) have demonstrated its effectiveness over state-of-the-art diffusion-based DA methods.

Cite

Text

Wang and Chen. "Inversion Circle Interpolation: Diffusion-Based Image Augmentation for Data-Scarce Classification." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.02380

Markdown

[Wang and Chen. "Inversion Circle Interpolation: Diffusion-Based Image Augmentation for Data-Scarce Classification." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/wang2025cvpr-inversion/) doi:10.1109/CVPR52734.2025.02380

BibTeX

@inproceedings{wang2025cvpr-inversion,
  title     = {{Inversion Circle Interpolation: Diffusion-Based Image Augmentation for Data-Scarce Classification}},
  author    = {Wang, Yanghao and Chen, Long},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {25560-25569},
  doi       = {10.1109/CVPR52734.2025.02380},
  url       = {https://mlanthology.org/cvpr/2025/wang2025cvpr-inversion/}
}