Self-Supervised Correlation Mining Network for Person Image Generation
Abstract
Person image generation aims to perform non-rigid deformation on source images, which generally requires unaligned data pairs for training. Recently, self-supervised methods express great prospects in this task by merging the disentangled representations for self-reconstruction. However, such methods fail to exploit the spatial correlation between the disentangled features. In this paper, we propose a Self-supervised Correlation Mining Network (SCM-Net) to rearrange the source images in the feature space, in which two collaborative modules are integrated, Decomposed Style Encoder (DSE) and Correlation Mining Module (CMM). Specifically, the DSE first creates unaligned pairs at the feature level. Then, the CMM establishes the spatial correlation field for feature rearrangement. Eventually, a translation module transforms the rearranged features to realistic results. Meanwhile, for improving the fidelity of cross-scale pose transformation, we propose a graph based Body Structure Retaining Loss (BSR Loss) to preserve reasonable body structures on half body to full body generation. Extensive experiments conducted on DeepFashion dataset demonstrate the superiority of our method compared with other supervised and unsupervised approaches. Furthermore, satisfactory results on face generation show the versatility of our method in other deformation tasks.
Cite
Text
Wang et al. "Self-Supervised Correlation Mining Network for Person Image Generation." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00755Markdown
[Wang et al. "Self-Supervised Correlation Mining Network for Person Image Generation." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/wang2022cvpr-selfsupervised/) doi:10.1109/CVPR52688.2022.00755BibTeX
@inproceedings{wang2022cvpr-selfsupervised,
title = {{Self-Supervised Correlation Mining Network for Person Image Generation}},
author = {Wang, Zijian and Qi, Xingqun and Yuan, Kun and Sun, Muyi},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2022},
pages = {7703-7712},
doi = {10.1109/CVPR52688.2022.00755},
url = {https://mlanthology.org/cvpr/2022/wang2022cvpr-selfsupervised/}
}