Spin: Diffusion-Based Semantic Image Painting Through Independent Information Injection
Abstract
Diffusion models have been utilized as powerful tools for various image editing tasks, including semantic image painting (SIP), which aims to generate content within masked regions conditioned on a reference image or text. SIP, especially those using images as conditions, often suffers from three issues: semantic inconsistency, unnatural transitions, and style inconsistency, which significantly hinder its practical application. To address these challenges, we propose a novel Semantic Image Painting framework with INdependent INformation INjection (Spin). Specifically, we compute a saliency map to segregate the reference image into salient and non-salient components. We then filter out the non-salient information during the semantic embedding extraction phase and precisely inject the semantic embedding into the masked region instead of the whole image during the semantic generation phase. Furthermore, we impose an additional style guidance to promote style consistency between background and foreground. Experimental results demonstrate that Spin achieve superior semantic similarity and image coherence across various styles, including realistic, pencil drawings, cartoon, and oil painting. Additionally, Spin offers diversity and editability, and can be integrated into other models that meet our prerequisites.
Cite
Text
Wu et al. "Spin: Diffusion-Based Semantic Image Painting Through Independent Information Injection." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I8.32901Markdown
[Wu et al. "Spin: Diffusion-Based Semantic Image Painting Through Independent Information Injection." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/wu2025aaai-spin/) doi:10.1609/AAAI.V39I8.32901BibTeX
@inproceedings{wu2025aaai-spin,
title = {{Spin: Diffusion-Based Semantic Image Painting Through Independent Information Injection}},
author = {Wu, Dantong and Chen, Zhiqiang and Du, Tianjiao and Ran, Peipei and Bai, Mengchao and Zhang, Kai},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {8351-8358},
doi = {10.1609/AAAI.V39I8.32901},
url = {https://mlanthology.org/aaai/2025/wu2025aaai-spin/}
}