S3FD: Single Shot Scale-Invariant Face Detector

Abstract

This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S3FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces. Specifically, we try to solve the common problem that anchor-based detectors deteriorate dramatically as the objects become smaller. We make contributions in the following three aspects: 1) proposing a scale-equitable face detection framework to handle different scales of faces well. We tile anchors on a wide range of layers to ensure that all scales of faces have enough features for detection. Besides, we design anchor scales based on the effective receptive field and a proposed equal proportion interval principle; 2) improving the recall rate of small faces by a scale compensation anchor matching strategy; 3) reducing the false positive rate of small faces via a max-out background label. As a consequence, our method achieves state-of-the-art detection performance on all the common face detection benchmarks, including the AFW, PASCAL face, FDDB and WIDER FACE datasets, and can run at 36 FPS on a Nvidia Titan X (Pascal) for VGA-resolution images.

Cite

Text

Zhang et al. "S3FD: Single Shot Scale-Invariant Face Detector." International Conference on Computer Vision, 2017. doi:10.1109/ICCV.2017.30

Markdown

[Zhang et al. "S3FD: Single Shot Scale-Invariant Face Detector." International Conference on Computer Vision, 2017.](https://mlanthology.org/iccv/2017/zhang2017iccv-s3fd/) doi:10.1109/ICCV.2017.30

BibTeX

@inproceedings{zhang2017iccv-s3fd,
  title     = {{S3FD: Single Shot Scale-Invariant Face Detector}},
  author    = {Zhang, Shifeng and Zhu, Xiangyu and Lei, Zhen and Shi, Hailin and Wang, Xiaobo and Li, Stan Z.},
  booktitle = {International Conference on Computer Vision},
  year      = {2017},
  doi       = {10.1109/ICCV.2017.30},
  url       = {https://mlanthology.org/iccv/2017/zhang2017iccv-s3fd/}
}