Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images

Abstract

This paper studies the task of estimating the 3D human poses of multiple persons from multiple calibrated camera views. Following the top-down paradigm, we decompose the task into two stages, i.e. person localization and pose estimation. Both stages are processed in coarse-to-fine manners. And we propose three task-specific graph neural networks for effective message passing. For 3D person localization, we first use Multi-view Matching Graph Module (MMG) to learn the cross-view association and recover coarse human proposals. The Center Refinement Graph Module (CRG) further refines the results via flexible point-based prediction. For 3D pose estimation, the Pose Regression Graph Module (PRG) learns both the multi-view geometry and structural relations between human joints. Our approach achieves state-of-the-art performance on CMU Panoptic and Shelf datasets with significantly lower computation complexity.

Cite

Text

Wu et al. "Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.01096

Markdown

[Wu et al. "Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/wu2021iccv-graphbased/) doi:10.1109/ICCV48922.2021.01096

BibTeX

@inproceedings{wu2021iccv-graphbased,
  title     = {{Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images}},
  author    = {Wu, Size and Jin, Sheng and Liu, Wentao and Bai, Lei and Qian, Chen and Liu, Dong and Ouyang, Wanli},
  booktitle = {International Conference on Computer Vision},
  year      = {2021},
  pages     = {11148-11157},
  doi       = {10.1109/ICCV48922.2021.01096},
  url       = {https://mlanthology.org/iccv/2021/wu2021iccv-graphbased/}
}