Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection

Abstract

Despite remarkable progress of face analysis techniques, detecting landmarks on large-pose faces is still difficult due to self-occlusion, subtle landmark difference and incomplete information. To address these challenging issues, we introduce a novel recurrent 3D-2D dual learning model that alternatively performs 2D-based 3D face model refinement and 3D-to-2D projection based 2D landmark refinement to reliably reason about self-occluded landmarks, precisely capture the subtle landmark displacement and accurately detect landmarks even in presence of extremely large poses. The proposed model presents the first loop-closed learning framework that effectively exploits the informative feedback from the 3D-2D learning and its dual 2D-3D refinement tasks in a recurrent manner. Benefiting from these two mutual-boosting steps, our proposed model demonstrates appealing robustness to large poses (up to profile pose) and outstanding ability to capture fine-scale landmark displacement compared with existing 3D models. It achieves new state-of-the-art on the challenging AFLW benchmark. Moreover, our proposed model introduces a new architectural design that economically utilizes intermediate features and achieves 4x faster speed than its deep learning based counterparts.

Cite

Text

Xiao et al. "Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection." International Conference on Computer Vision, 2017. doi:10.1109/ICCV.2017.181

Markdown

[Xiao et al. "Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection." International Conference on Computer Vision, 2017.](https://mlanthology.org/iccv/2017/xiao2017iccv-recurrent/) doi:10.1109/ICCV.2017.181

BibTeX

@inproceedings{xiao2017iccv-recurrent,
  title     = {{Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection}},
  author    = {Xiao, Shengtao and Feng, Jiashi and Liu, Luoqi and Nie, Xuecheng and Wang, Wei and Yan, Shuicheng and Kassim, Ashraf},
  booktitle = {International Conference on Computer Vision},
  year      = {2017},
  doi       = {10.1109/ICCV.2017.181},
  url       = {https://mlanthology.org/iccv/2017/xiao2017iccv-recurrent/}
}