Facial Landmark Detection by Deep Multi-Task Learning
Abstract
Facial landmark detection has long been impeded by the problems of occlusion and pose variation. Instead of treating the detection task as a single and independent problem, we investigate the possibility of improving detection robustness through multi-task learning. Specifically, we wish to optimize facial landmark detection together with heterogeneous but subtly correlated tasks, e.g. head pose estimation and facial attribute inference. This is non-trivial since different tasks have different learning difficulties and convergence rates. To address this problem, we formulate a novel tasks-constrained deep model, with task-wise early stopping to facilitate learning convergence. Extensive evaluations show that the proposed task-constrained learning (i) outperforms existing methods, especially in dealing with faces with severe occlusion and pose variation, and (ii) reduces model complexity drastically compared to the state-of-the-art method based on cascaded deep model [21].
Cite
Text
Zhang et al. "Facial Landmark Detection by Deep Multi-Task Learning." European Conference on Computer Vision, 2014. doi:10.1007/978-3-319-10599-4_7Markdown
[Zhang et al. "Facial Landmark Detection by Deep Multi-Task Learning." European Conference on Computer Vision, 2014.](https://mlanthology.org/eccv/2014/zhang2014eccv-facial/) doi:10.1007/978-3-319-10599-4_7BibTeX
@inproceedings{zhang2014eccv-facial,
title = {{Facial Landmark Detection by Deep Multi-Task Learning}},
author = {Zhang, Zhanpeng and Luo, Ping and Loy, Chen Change and Tang, Xiaoou},
booktitle = {European Conference on Computer Vision},
year = {2014},
pages = {94-108},
doi = {10.1007/978-3-319-10599-4_7},
url = {https://mlanthology.org/eccv/2014/zhang2014eccv-facial/}
}