Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network Cascade
Abstract
We present a new approach to localize extensive facial landmarks with a coarse-to-fine convolutional network cascade. Deep convolutional neural networks (DCNN) have been successfully utilized in facial landmark localization for two-fold advantages: 1) geometric constraints among facial points are implicitly utilized, 2) huge amount of training data can be leveraged. However, in the task of extensive facial landmark localization, a large number of facial landmarks (more than 50 points) are required to be located in a unified system, which poses great difficulty in the structure design and training process of traditional convolutional networks. In this paper, we design a four-level convolutional network cascade, which tackles the problem in a coarse-to-fine manner. In our system, each network level is trained to locally refine a subset of facial landmarks generated by previous network levels. In addition, each level predicts explicit geometric constraints (the position and rotation angles of a specific facial component) to rectify the inputs of the current network level. The combination of coarse-to-fine cascade and geometric refinement enables our system to locate extensive facial landmarks (68 points) accurately in the 300-W facial landmark localization challenge.
Cite
Text
Zhou et al. "Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network Cascade." IEEE/CVF International Conference on Computer Vision Workshops, 2013. doi:10.1109/ICCVW.2013.58Markdown
[Zhou et al. "Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network Cascade." IEEE/CVF International Conference on Computer Vision Workshops, 2013.](https://mlanthology.org/iccvw/2013/zhou2013iccvw-extensive/) doi:10.1109/ICCVW.2013.58BibTeX
@inproceedings{zhou2013iccvw-extensive,
title = {{Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network Cascade}},
author = {Zhou, Erjin and Fan, Haoqiang and Cao, Zhimin and Jiang, Yuning and Yin, Qi},
booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
year = {2013},
pages = {386-391},
doi = {10.1109/ICCVW.2013.58},
url = {https://mlanthology.org/iccvw/2013/zhou2013iccvw-extensive/}
}