Beyond Textual Constraints: Learning Novel Diffusion Conditions with Fewer Examples
Abstract
In this paper we delve into a novel aspect of learning novel diffusion conditions with datasets an order of magnitude smaller. The rationale behind our approach is the elimination of textual constraints during the few-shot learning process. To that end we implement two optimization strategies. The first prompt-free conditional learning utilizes a prompt-free encoder derived from a pre-trained Stable Diffusion model. This strategy is designed to adapt new conditions to the diffusion process by minimizing the textual-visual correlation thereby ensuring a more precise alignment between the generated content and the specified conditions. The second strategy entails condition-specific negative rectification which addresses the inconsistencies typically brought about by Classifier-free guidance in few-shot training contexts. Our extensive experiments across a variety of condition modalities demonstrate the effectiveness and efficiency of our framework yielding results comparable to those obtained with datasets a thousand times larger. Our codes are available at https://github.com/Yuyan9Yu/BeyondTextConstraint.
Cite
Text
Yu et al. "Beyond Textual Constraints: Learning Novel Diffusion Conditions with Fewer Examples." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00679Markdown
[Yu et al. "Beyond Textual Constraints: Learning Novel Diffusion Conditions with Fewer Examples." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/yu2024cvpr-beyond/) doi:10.1109/CVPR52733.2024.00679BibTeX
@inproceedings{yu2024cvpr-beyond,
title = {{Beyond Textual Constraints: Learning Novel Diffusion Conditions with Fewer Examples}},
author = {Yu, Yuyang and Liu, Bangzhen and Zheng, Chenxi and Xu, Xuemiao and Zhang, Huaidong and He, Shengfeng},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2024},
pages = {7109-7118},
doi = {10.1109/CVPR52733.2024.00679},
url = {https://mlanthology.org/cvpr/2024/yu2024cvpr-beyond/}
}