IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation

Abstract

Generative object compositing emerges as a promising new avenue for compositional image editing. However the requirement of object identity preservation poses a significant challenge limiting practical usage of most existing methods. In response this paper introduces IMPRINT a novel diffusion-based generative model trained with a two-stage learning framework that decouples learning of identity preservation from that of compositing. The first stage is targeted for context-agnostic identity-preserving pretraining of the object encoder enabling the encoder to learn an embedding that is both view-invariant and conducive to enhanced detail preservation. The subsequent stage leverages this representation to learn seamless harmonization of the object composited to the background. In addition IMPRINT incorporates a shape-guidance mechanism offering user-directed control over the compositing process. Extensive experiments demonstrate that IMPRINT significantly outperforms existing methods and various baselines on identity preservation and composition quality.

Cite

Text

Song et al. "IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00769

Markdown

[Song et al. "IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/song2024cvpr-imprint/) doi:10.1109/CVPR52733.2024.00769

BibTeX

@inproceedings{song2024cvpr-imprint,
  title     = {{IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation}},
  author    = {Song, Yizhi and Zhang, Zhifei and Lin, Zhe and Cohen, Scott and Price, Brian and Zhang, Jianming and Kim, Soo Ye and Zhang, He and Xiong, Wei and Aliaga, Daniel},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {8048-8058},
  doi       = {10.1109/CVPR52733.2024.00769},
  url       = {https://mlanthology.org/cvpr/2024/song2024cvpr-imprint/}
}