Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network Cascade

Zhou, Erjin; Fan, Haoqiang; Cao, Zhimin; Jiang, Yuning; Yin, Qi

doi:10.1109/ICCVW.2013.58

Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network Cascade

Erjin Zhou, Haoqiang Fan, Zhimin Cao, Yuning Jiang, Qi Yin

ICCVW 2013 pp. 386-391

doi:10.1109/ICCVW.2013.58 /iccvw/2013/zhou2013iccvw-extensive/

Abstract

We present a new approach to localize extensive facial landmarks with a coarse-to-fine convolutional network cascade. Deep convolutional neural networks (DCNN) have been successfully utilized in facial landmark localization for two-fold advantages: 1) geometric constraints among facial points are implicitly utilized, 2) huge amount of training data can be leveraged. However, in the task of extensive facial landmark localization, a large number of facial landmarks (more than 50 points) are required to be located in a unified system, which poses great difficulty in the structure design and training process of traditional convolutional networks. In this paper, we design a four-level convolutional network cascade, which tackles the problem in a coarse-to-fine manner. In our system, each network level is trained to locally refine a subset of facial landmarks generated by previous network levels. In addition, each level predicts explicit geometric constraints (the position and rotation angles of a specific facial component) to rectify the inputs of the current network level. The combination of coarse-to-fine cascade and geometric refinement enables our system to locate extensive facial landmarks (68 points) accurately in the 300-W facial landmark localization challenge.

ICCVW Semantic Scholar

Cite

Text

Zhou et al. "Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network Cascade." IEEE/CVF International Conference on Computer Vision Workshops, 2013. doi:10.1109/ICCVW.2013.58

Markdown

[Zhou et al. "Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network Cascade." IEEE/CVF International Conference on Computer Vision Workshops, 2013.](https://mlanthology.org/iccvw/2013/zhou2013iccvw-extensive/) doi:10.1109/ICCVW.2013.58

BibTeX

@inproceedings{zhou2013iccvw-extensive,
  title     = {{Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network Cascade}},
  author    = {Zhou, Erjin and Fan, Haoqiang and Cao, Zhimin and Jiang, Yuning and Yin, Qi},
  booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
  year      = {2013},
  pages     = {386-391},
  doi       = {10.1109/ICCVW.2013.58},
  url       = {https://mlanthology.org/iccvw/2013/zhou2013iccvw-extensive/}
}