High-Resolution Image Harmonization via Collaborative Dual Transformations

Abstract

Given a composite image, image harmonization aims to adjust the foreground to make it compatible with the background. High-resolution image harmonization is in high demand, but still remains unexplored. Conventional image harmonization methods learn global RGB-to-RGB transformation which could effortlessly scale to high resolution, but ignore diverse local context. Recent deep learning methods learn the dense pixel-to-pixel transformation which could generate harmonious outputs, but are highly constrained in low resolution. In this work, we propose a high-resolution image harmonization network with Collaborative Dual Transformation (CDTNet) to combine pixel-to-pixel transformation and RGB-to-RGB transformation coherently in an end-to-end network. Our CDTNet consists of a low-resolution generator for pixel-to-pixel transformation, a color mapping module for RGB-to-RGB transformation, and a refinement module to take advantage of both. Extensive experiments on high-resolution benchmark dataset and our created high-resolution real composite images demonstrate that our CDTNet strikes a good balance between efficiency and effectiveness. Our used datasets can be found in https://github.com/bcmi/CDTNet-High-Resolution-Image-Harmonization.

Cite

Text

Cong et al. "High-Resolution Image Harmonization via Collaborative Dual Transformations." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.01792

Markdown

[Cong et al. "High-Resolution Image Harmonization via Collaborative Dual Transformations." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/cong2022cvpr-highresolution/) doi:10.1109/CVPR52688.2022.01792

BibTeX

@inproceedings{cong2022cvpr-highresolution,
  title     = {{High-Resolution Image Harmonization via Collaborative Dual Transformations}},
  author    = {Cong, Wenyan and Tao, Xinhao and Niu, Li and Liang, Jing and Gao, Xuesong and Sun, Qihao and Zhang, Liqing},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {18470-18479},
  doi       = {10.1109/CVPR52688.2022.01792},
  url       = {https://mlanthology.org/cvpr/2022/cong2022cvpr-highresolution/}
}