Automatic Facial Landmark Labeling with Minimal Supervision

Abstract

Landmark labeling of training images is essential for many learning tasks in computer vision, such as object detection, tracking, and alignment. Image labeling is typically conducted manually, which is both labor-intensive and error-prone. To improve this process, this paper proposes a new approach to estimate a set of landmarks for a large image ensemble with only a small number of manually labeled images from the ensemble. Our approach, named semi-supervised least-squares congealing, aims to minimize an objective function defined on both labeled and unlabeled images. A shape model is learnt on-line to constrain the landmark configuration. We also employ a partitioning strategy to allow coarse-to-fine landmark estimation. Extensive experiments on facial images show that our approach can reliably and accurately label landmarks for a large image ensemble starting from a small number of manually labeled images, under various challenging scenarios.

Cite

Text

Tong et al. "Automatic Facial Landmark Labeling with Minimal Supervision." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009. doi:10.1109/CVPR.2009.5206670

Markdown

[Tong et al. "Automatic Facial Landmark Labeling with Minimal Supervision." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009.](https://mlanthology.org/cvpr/2009/tong2009cvpr-automatic/) doi:10.1109/CVPR.2009.5206670

BibTeX

@inproceedings{tong2009cvpr-automatic,
  title     = {{Automatic Facial Landmark Labeling with Minimal Supervision}},
  author    = {Tong, Yan and Liu, Xiaoming and Wheeler, Frederick W. and Tu, Peter H.},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2009},
  pages     = {2097-2104},
  doi       = {10.1109/CVPR.2009.5206670},
  url       = {https://mlanthology.org/cvpr/2009/tong2009cvpr-automatic/}
}