Human Motion Transfer from Poses in the Wild

Abstract

In this paper, we tackle the problem of human motion transfer, where we synthesize novel motion video for a target person that imitates the movement from a reference video. It is a video-to-video translation task in which the estimated poses are used to bridge two domains. Despite substantial progress on the topic, there exist several problems with the previous methods. First, there is a domain gap between training and testing pose sequences--the model is tested on poses it has not seen during training, such as difficult dancing moves. Furthermore, pose detection errors are inevitable, making the job of the generator harder. Finally, generating realistic pixels from sparse poses is challenging in a single step. To address these challenges, we introduce a novel pose-to-video translation framework for generating high-quality videos that are temporally coherent even for in-the-wild pose sequences unseen during training. We propose a pose augmentation method to minimize the training-test gap, a unified paired and unpaired learning strategy to improve the robustness to detection errors, and two-stage network architecture to achieve superior texture quality. To further boost research on the topic, we build two human motion datasets. Finally, we show the superiority of our approach over the state-of-the-art studies through extensive experiments and evaluations on different datasets.

Cite

Text

Ren et al. "Human Motion Transfer from Poses in the Wild." European Conference on Computer Vision Workshops, 2020. doi:10.1007/978-3-030-67070-2_16

Markdown

[Ren et al. "Human Motion Transfer from Poses in the Wild." European Conference on Computer Vision Workshops, 2020.](https://mlanthology.org/eccvw/2020/ren2020eccvw-human/) doi:10.1007/978-3-030-67070-2_16

BibTeX

@inproceedings{ren2020eccvw-human,
  title     = {{Human Motion Transfer from Poses in the Wild}},
  author    = {Ren, Jian and Chai, Menglei and Tulyakov, Sergey and Fang, Chen and Shen, Xiaohui and Yang, Jianchao},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2020},
  pages     = {262-279},
  doi       = {10.1007/978-3-030-67070-2_16},
  url       = {https://mlanthology.org/eccvw/2020/ren2020eccvw-human/}
}