Facial Landmark Detection by Deep Multi-Task Learning

Abstract

Facial landmark detection has long been impeded by the problems of occlusion and pose variation. Instead of treating the detection task as a single and independent problem, we investigate the possibility of improving detection robustness through multi-task learning. Specifically, we wish to optimize facial landmark detection together with heterogeneous but subtly correlated tasks, e.g. head pose estimation and facial attribute inference. This is non-trivial since different tasks have different learning difficulties and convergence rates. To address this problem, we formulate a novel tasks-constrained deep model, with task-wise early stopping to facilitate learning convergence. Extensive evaluations show that the proposed task-constrained learning (i) outperforms existing methods, especially in dealing with faces with severe occlusion and pose variation, and (ii) reduces model complexity drastically compared to the state-of-the-art method based on cascaded deep model [21].

Cite

Text

Zhang et al. "Facial Landmark Detection by Deep Multi-Task Learning." European Conference on Computer Vision, 2014. doi:10.1007/978-3-319-10599-4_7

Markdown

[Zhang et al. "Facial Landmark Detection by Deep Multi-Task Learning." European Conference on Computer Vision, 2014.](https://mlanthology.org/eccv/2014/zhang2014eccv-facial/) doi:10.1007/978-3-319-10599-4_7

BibTeX

@inproceedings{zhang2014eccv-facial,
  title     = {{Facial Landmark Detection by Deep Multi-Task Learning}},
  author    = {Zhang, Zhanpeng and Luo, Ping and Loy, Chen Change and Tang, Xiaoou},
  booktitle = {European Conference on Computer Vision},
  year      = {2014},
  pages     = {94-108},
  doi       = {10.1007/978-3-319-10599-4_7},
  url       = {https://mlanthology.org/eccv/2014/zhang2014eccv-facial/}
}