Scene-Aware Generative Network for Human Motion Synthesis
Abstract
We revisit human motion synthesis, a task useful in various real-world applications, in this paper. Whereas a number of methods have been developed previously for this task, they are often limited in two aspects: 1) focus on the poses while leaving the location movement behind, and 2) ignore the impact of the environment on the human motion. In this paper, we propose a new framework, with the interaction between the scene and the human motion is taken into account. Considering the uncertainty of human motion, we formulate this task as a generative task, whose objective is to generate plausible human motion conditioned on both the scene and the human's initial position. This framework factorizes the distribution of human motions into a distribution of movement trajectories conditioned on scenes and that of body pose dynamics conditioned on both scenes and trajectories. We further derive a GAN-based learning approach, with discriminators to enforce the compatibility between the human motion and the contextual scene as well as the 3D-to-2D projection constraints. We assess the effectiveness of the proposed method on two challenging datasets, which cover both synthetic and real-world environmentemphasizes local structural constraints via depth-map crops, and a projection discriminator that emphasizes global structural constraints via 3D-to-2D motion projections. The effectiveness of our framework is comprehensively evaluated on two large challenging datasets, covering both a synthetic environment (GTA-IM) and a real environment (PROX)
Cite
Text
Wang et al. "Scene-Aware Generative Network for Human Motion Synthesis." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.01203Markdown
[Wang et al. "Scene-Aware Generative Network for Human Motion Synthesis." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/wang2021cvpr-sceneaware/) doi:10.1109/CVPR46437.2021.01203BibTeX
@inproceedings{wang2021cvpr-sceneaware,
title = {{Scene-Aware Generative Network for Human Motion Synthesis}},
author = {Wang, Jingbo and Yan, Sijie and Dai, Bo and Lin, Dahua},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2021},
pages = {12206-12215},
doi = {10.1109/CVPR46437.2021.01203},
url = {https://mlanthology.org/cvpr/2021/wang2021cvpr-sceneaware/}
}