IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation
Abstract
Generative object compositing emerges as a promising new avenue for compositional image editing. However the requirement of object identity preservation poses a significant challenge limiting practical usage of most existing methods. In response this paper introduces IMPRINT a novel diffusion-based generative model trained with a two-stage learning framework that decouples learning of identity preservation from that of compositing. The first stage is targeted for context-agnostic identity-preserving pretraining of the object encoder enabling the encoder to learn an embedding that is both view-invariant and conducive to enhanced detail preservation. The subsequent stage leverages this representation to learn seamless harmonization of the object composited to the background. In addition IMPRINT incorporates a shape-guidance mechanism offering user-directed control over the compositing process. Extensive experiments demonstrate that IMPRINT significantly outperforms existing methods and various baselines on identity preservation and composition quality.
Cite
Text
Song et al. "IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00769Markdown
[Song et al. "IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/song2024cvpr-imprint/) doi:10.1109/CVPR52733.2024.00769BibTeX
@inproceedings{song2024cvpr-imprint,
title = {{IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation}},
author = {Song, Yizhi and Zhang, Zhifei and Lin, Zhe and Cohen, Scott and Price, Brian and Zhang, Jianming and Kim, Soo Ye and Zhang, He and Xiong, Wei and Aliaga, Daniel},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2024},
pages = {8048-8058},
doi = {10.1109/CVPR52733.2024.00769},
url = {https://mlanthology.org/cvpr/2024/song2024cvpr-imprint/}
}