Differentiable Rendering-Based Pose-Conditioned Human Image Generation

Abstract

Conditional human image generation, or generation of human images with specified pose based on one or more reference images, is an inherently ill-defined problem, as there can be multiple plausible appearance for parts that are occluded in the reference. Using multiple images can mitigate this problem while boosting the performance. In this work, we introduce a differentiable vertex and edge renderer for incorporating the pose information to realize human image generation conditioned on multiple reference images. The differentiable renderer has parameters that can be jointly optimized with other parts of the system to obtain better results by learning more meaningful shape representation of human pose. We evaluate our method on the Market-1501 and DeepFashion datasets and comparison with existing approaches validates the effectiveness of our approach.

Cite

Text

Horiuchi et al. "Differentiable Rendering-Based Pose-Conditioned Human Image Generation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2021. doi:10.1109/CVPRW53098.2021.00437

Markdown

[Horiuchi et al. "Differentiable Rendering-Based Pose-Conditioned Human Image Generation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2021.](https://mlanthology.org/cvprw/2021/horiuchi2021cvprw-differentiable/) doi:10.1109/CVPRW53098.2021.00437

BibTeX

@inproceedings{horiuchi2021cvprw-differentiable,
  title     = {{Differentiable Rendering-Based Pose-Conditioned Human Image Generation}},
  author    = {Horiuchi, Yusuke and Simo-Serra, Edgar and Iizuka, Satoshi and Ishikawa, Hiroshi},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2021},
  pages     = {3921-3925},
  doi       = {10.1109/CVPRW53098.2021.00437},
  url       = {https://mlanthology.org/cvprw/2021/horiuchi2021cvprw-differentiable/}
}