Global Adaptation Meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation

Abstract

When applying a pre-trained 2D-to-3D Human Pose lifting model to a target unseen dataset, a large performance degradation is commonly encountered due to domain shift issues. We observe that the degradation is caused by two factors: 1) the large distribution gap over global positions of poses between the source and target datasets due to variant camera parameters and settings, and 2) the deficient diversity of local structures of poses in the training. To this end, we combine global adaptation and local generalization in PoseDA, a simple yet effective framework of unsupervised domain adaptation for 3D human pose estimation. Specifically, global adaptation aims to align global positions of poses from source domain to target domain with a proposed global position alignment (GPA) module. This module brings significant performance improvement without introducing additional learnable parameters. In addition, we propose local pose augmentation (LPA) to enhance the diversity of 3D poses following an adversarial training scheme consisting of 1) a augmentation generator that generates the parameters of pre-defined pose transformations and 2) an anchor discriminator to ensure the reality and quality of the augmented data. Our approach can be applicable to almost all 2D-3D lifting models. PoseDA achieves 61.3 mm of MPJPE on MPI-INF-3DHP under a cross-dataset evaluation setup, improving upon the previous state-of-the-art method by 10.2%.

Cite

Text

Chai et al. "Global Adaptation Meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.01347

Markdown

[Chai et al. "Global Adaptation Meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/chai2023iccv-global/) doi:10.1109/ICCV51070.2023.01347

BibTeX

@inproceedings{chai2023iccv-global,
  title     = {{Global Adaptation Meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation}},
  author    = {Chai, Wenhao and Jiang, Zhongyu and Hwang, Jenq-Neng and Wang, Gaoang},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {14655-14665},
  doi       = {10.1109/ICCV51070.2023.01347},
  url       = {https://mlanthology.org/iccv/2023/chai2023iccv-global/}
}