Towards Stable Human Pose Estimation via Cross-View Fusion and Foot Stabilization

Abstract

Towards stable human pose estimation from monocular images, there remain two main dilemmas. On the one hand, the different perspectives, i.e., front view, side view, and top view, appear the inconsistent performances due to the depth ambiguity. On the other hand, foot posture plays a significant role in complicated human pose estimation, i.e., dance and sports, and foot-ground interaction, but unfortunately, it is omitted in most general approaches and datasets. In this paper, we first propose the Cross-View Fusion (CVF) module to catch up with better 3D intermediate representation and alleviate the view inconsistency based on the vision transformer encoder. Then the optimization-based method is introduced to reconstruct the foot pose and foot-ground contact for the general multi-view datasets including AIST++ and Human3.6M. Besides, the reversible kinematic topology strategy is innovated to utilize the contact information into the full-body with foot pose regressor. Extensive experiments on the popular benchmarks demonstrate that our method outperforms the state-of-the-art approaches by achieving 40.1mm PA-MPJPE on the 3DPW test set and 43.8mm on the AIST++ test set.

Cite

Text

Zhuo et al. "Towards Stable Human Pose Estimation via Cross-View Fusion and Foot Stabilization." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.00070

Markdown

[Zhuo et al. "Towards Stable Human Pose Estimation via Cross-View Fusion and Foot Stabilization." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/zhuo2023cvpr-stable/) doi:10.1109/CVPR52729.2023.00070

BibTeX

@inproceedings{zhuo2023cvpr-stable,
  title     = {{Towards Stable Human Pose Estimation via Cross-View Fusion and Foot Stabilization}},
  author    = {Zhuo, Li’an and Cao, Jian and Wang, Qi and Zhang, Bang and Bo, Liefeng},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {650-659},
  doi       = {10.1109/CVPR52729.2023.00070},
  url       = {https://mlanthology.org/cvpr/2023/zhuo2023cvpr-stable/}
}