DesignEdit: Unify Spatial-Aware Image Editing via Training-Free Inpainting with a Multi-Layered Latent Diffusion Framework

Abstract

Spatial-aware image editing focuses on modifying the position and size of elements within a given image. However, previous works still struggle with maintaining background harmony in the original editing areas, as well as preserving the initial identity of the edited elements, making it difficult to achieve complex multi-object editing in a single pass. In this paper, we aim to perform flexible spatial editing in a simple yet straightforward manner. We propose to inpaint the background first and develop a two-stage multi-layered latent diffusion framework to edit each element independently. Specifically, we design a key-masking self-attention scheme alongside artifact suppression to achieve background inpainting within the denoising process, leveraging the powerful generative capabilities of the Latent Diffusion Model, Stable Diffusion XL-1.0. The latent decomposition and fusion framework is capable of unifying various spatial-aware operations, including removal, resizing, relocation, flipping, addition, camera panning, zooming out, occlusion-aware editing, and cross-image editing. Experiments demonstrate the superior inpainting quality for object removal, along with enhanced versatility and higher precision in spatial-aware editing achieved by our method.

Cite

Text

Jia et al. "DesignEdit: Unify Spatial-Aware Image Editing via Training-Free Inpainting with a Multi-Layered Latent Diffusion Framework." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I4.32414

Markdown

[Jia et al. "DesignEdit: Unify Spatial-Aware Image Editing via Training-Free Inpainting with a Multi-Layered Latent Diffusion Framework." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/jia2025aaai-designedit/) doi:10.1609/AAAI.V39I4.32414

BibTeX

@inproceedings{jia2025aaai-designedit,
  title     = {{DesignEdit: Unify Spatial-Aware Image Editing via Training-Free Inpainting with a Multi-Layered Latent Diffusion Framework}},
  author    = {Jia, Yueru and Cheng, Aosong and Yuan, Yuhui and Wang, Chuke and Li, Ji and Jia, Huizhu and Zhang, Shanghang},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {3958-3966},
  doi       = {10.1609/AAAI.V39I4.32414},
  url       = {https://mlanthology.org/aaai/2025/jia2025aaai-designedit/}
}