Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion

Abstract

In the field of Few-Shot Image Generation (FSIG) using Deep Generative Models (DGMs), accurately estimating the distribution of target domain with minimal samples poses a significant challenge. This requires a method that can both capture the broad diversity and the true characteristics of the target domain distribution. We present Conditional Relaxing Diffusion Inversion (CRDI), an innovative ‘training-free’ approach designed to enhance distribution diversity in synthetic image generation. Distinct from conventional methods, CRDI does not rely on fine-tuning based on only a few samples. Instead, it focuses on reconstructing each target image instance and expanding diversity through a few-shot learning. The approach initiates by identifying a Sample-wise Guidance Embedding (SGE) for the diffusion model, which serves a purpose analogous to the explicit latent codes in certain Generative Adversarial Network (GAN) models. Subsequently, the method involves a scheduler that progressively introduces perturbations to the SGE, thereby augmenting diversity. Comprehensive experiments demonstrate that our method outperforms GAN-based reconstruction techniques and achieves comparable performance to state-of-the-art (SOTA) FSIG methods. Additionally, it effectively mitigates overfitting and catastrophic forgetting, common drawbacks of fine-tuning approaches. Code is available at GitHub.

Cite

Text

Cao and Gong. "Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72907-2_2

Markdown

[Cao and Gong. "Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/cao2024eccv-fewshot/) doi:10.1007/978-3-031-72907-2_2

BibTeX

@inproceedings{cao2024eccv-fewshot,
  title     = {{Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion}},
  author    = {Cao, Yu and Gong, Shaogang},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-72907-2_2},
  url       = {https://mlanthology.org/eccv/2024/cao2024eccv-fewshot/}
}