Dis²Booth: Learning Image Distribution with Disentangled Features for Text-to-Image Diffusion Models
Abstract
Personalized image generation enables customized content creation based on the text-to-image diffusion models.However, existing personalization methods focus on fine-tuning generative models to learn to generate specific single individuals or concepts, such as an image of a specific Corgi, but are unable to generate data for multiple individuals or concepts with common characteristics, such as images of multiple different Corgis. In this work, we focus on personalizing a diffusion model to generated varied data usually containing multiple subjects, which has a more diverse and complex data distribution. Our basic assumption is that the varied data distribution is composed of the common features shared among all samples, as well as the reasonable variations within it. Accordingly, we are capable to decompose the learning process of complex data distributions into two simpler sub-tasks, employing a divide-and-conquer approach. To this end we propose Dis2Booth, a framework that can learn complex image Distribution by Disentangling data distribution in an unsupervised manner.Specifically, Dis2Booth contains two modules, Anchor LoRA and Delta LoRA, that are tasked with learning the common features and variational features constrained by Contextual Loss and Delta Loss unsupervisedly. Besides, the Asynchronous Optimization Strategy is proposed to ensure the collaborative training of the two modules. Extensive experiments suggest that Dis2Booth is able to learn the data distribution with higher diversity and complexity while maintaining the same level of flexibility as LoRA.
Cite
Text
Ding et al. "Dis²Booth: Learning Image Distribution with Disentangled Features for Text-to-Image Diffusion Models." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I3.32279Markdown
[Ding et al. "Dis²Booth: Learning Image Distribution with Disentangled Features for Text-to-Image Diffusion Models." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/ding2025aaai-dis/) doi:10.1609/AAAI.V39I3.32279BibTeX
@inproceedings{ding2025aaai-dis,
title = {{Dis²Booth: Learning Image Distribution with Disentangled Features for Text-to-Image Diffusion Models}},
author = {Ding, Guanqi and Yang, Chengyu and Wang, Shuhui and Li, Xincheng and Zhang, Jinzhe and Jin, Xin and Huang, Qingming},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {2744-2752},
doi = {10.1609/AAAI.V39I3.32279},
url = {https://mlanthology.org/aaai/2025/ding2025aaai-dis/}
}