Minority-Focused Text-to-Image Generation via Prompt Optimization

Abstract

We investigate the generation of minority samples using pretrained text-to-image (T2I) latent diffusion models. Minority instances, in the context of T2I generation, can be defined as ones living on low-density regions of text-conditional data distributions. They are valuable for various applications of modern T2I generators, such as data augmentation and creative AI. Unfortunately, existing pretrained T2I diffusion models primarily focus on high-density regions, largely due to the influence of guided samplers (like CFG) that are essential for high-quality generation. To address this, we present a novel framework to counter the high-density-focus of T2I diffusion models. Specifically, we first develop an online prompt optimization framework that encourages emergence of desired properties during inference while preserving semantic contents of user-provided prompts. We subsequently tailor this generic prompt optimizer into a specialized solver that promotes generation of minority features by incorporating a carefully-crafted likelihood objective. Extensive experiments conducted across various types of T2I models demonstrate that our approach significantly enhances the capability to produce high-quality minority instances compared to existing samplers. Code is available at https://github.com/soobin-um/MinorityPrompt.

Cite

Text

Um and Ye. "Minority-Focused Text-to-Image Generation via Prompt Optimization." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.01949

Markdown

[Um and Ye. "Minority-Focused Text-to-Image Generation via Prompt Optimization." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/um2025cvpr-minorityfocused/) doi:10.1109/CVPR52734.2025.01949

BibTeX

@inproceedings{um2025cvpr-minorityfocused,
  title     = {{Minority-Focused Text-to-Image Generation via Prompt Optimization}},
  author    = {Um, Soobin and Ye, Jong Chul},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {20926-20936},
  doi       = {10.1109/CVPR52734.2025.01949},
  url       = {https://mlanthology.org/cvpr/2025/um2025cvpr-minorityfocused/}
}