6DoF Head Pose Estimation Through Explicit Bidirectional Interaction with Face Geometry

Abstract

This study addresses the nuanced challenge of estimating head translations within the context of six-degrees-of-freedom (6DoF) head pose estimation, placing emphasis on this aspect over the more commonly studied head rotations. Identifying a gap in existing methodologies, we recognized the underutilized potential synergy between facial geometry and head translation. To bridge this gap, we propose a novel approach called the head Translation, Rotation, and face Geometry network (TRG), which stands out for its explicit bidirectional interaction structure. This structure has been carefully designed to leverage the complementary relationship between face geometry and head translation, marking a significant advancement in the field of head pose estimation. Our contributions also include the development of a strategy for estimating bounding box correction parameters and a technique for aligning landmarks to image. Both of these innovations demonstrate superior performance in 6DoF head pose estimation tasks. Extensive experiments conducted on ARKitFace and BIWI datasets confirm that the proposed method outperforms current state-of-the-art techniques. Codes are released at https://github. com/asw91666/TRG-Release.

Cite

Text

Chun and Chang. "6DoF Head Pose Estimation Through Explicit Bidirectional Interaction with Face Geometry." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73414-4_9

Markdown

[Chun and Chang. "6DoF Head Pose Estimation Through Explicit Bidirectional Interaction with Face Geometry." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/chun2024eccv-6dof/) doi:10.1007/978-3-031-73414-4_9

BibTeX

@inproceedings{chun2024eccv-6dof,
  title     = {{6DoF Head Pose Estimation Through Explicit Bidirectional Interaction with Face Geometry}},
  author    = {Chun, Sungho and Chang, Ju Yong},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73414-4_9},
  url       = {https://mlanthology.org/eccv/2024/chun2024eccv-6dof/}
}