Style Adaptive Semantic Image Editing with Transformers

Abstract

The goal of semantic image editing is to modify an image based on an input semantic label map, to carry out the necessary image manipulation. Existing approaches typically lack control over the style of the editing, resulting in insufficient flexibility to support the desired level of customization, e.g. , to turn an object into a particular style or to pick a specific instance. In this work, we propose Style Adaptive Semantic Image Editing (SASIE), where a reference image is used as an additional input about style, to guide the image manipulation process in a more adaptive manner. Moreover, we propose a new transformer-based architecture for SASIE, in which intra-/inter-image multi-head self-attention blocks transfer intra-/inter-knowledge. The content of the edited areas is synthesized according to the given semantic label, while the style of the edited areas is inherited from the reference image. Extensive experiments on multiple datasets suggest that our method is highly effective and enables customizable image manipulation.

Cite

Text

Günther et al. "Style Adaptive Semantic Image Editing with Transformers." European Conference on Computer Vision Workshops, 2022. doi:10.1007/978-3-031-25063-7_12

Markdown

[Günther et al. "Style Adaptive Semantic Image Editing with Transformers." European Conference on Computer Vision Workshops, 2022.](https://mlanthology.org/eccvw/2022/gunther2022eccvw-style/) doi:10.1007/978-3-031-25063-7_12

BibTeX

@inproceedings{gunther2022eccvw-style,
  title     = {{Style Adaptive Semantic Image Editing with Transformers}},
  author    = {Günther, Edward and Gong, Rui and Van Gool, Luc},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2022},
  pages     = {187-203},
  doi       = {10.1007/978-3-031-25063-7_12},
  url       = {https://mlanthology.org/eccvw/2022/gunther2022eccvw-style/}
}