S3FD: Single Shot Scale-Invariant Face Detector
Abstract
This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S3FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces. Specifically, we try to solve the common problem that anchor-based detectors deteriorate dramatically as the objects become smaller. We make contributions in the following three aspects: 1) proposing a scale-equitable face detection framework to handle different scales of faces well. We tile anchors on a wide range of layers to ensure that all scales of faces have enough features for detection. Besides, we design anchor scales based on the effective receptive field and a proposed equal proportion interval principle; 2) improving the recall rate of small faces by a scale compensation anchor matching strategy; 3) reducing the false positive rate of small faces via a max-out background label. As a consequence, our method achieves state-of-the-art detection performance on all the common face detection benchmarks, including the AFW, PASCAL face, FDDB and WIDER FACE datasets, and can run at 36 FPS on a Nvidia Titan X (Pascal) for VGA-resolution images.
Cite
Text
Zhang et al. "S3FD: Single Shot Scale-Invariant Face Detector." International Conference on Computer Vision, 2017. doi:10.1109/ICCV.2017.30Markdown
[Zhang et al. "S3FD: Single Shot Scale-Invariant Face Detector." International Conference on Computer Vision, 2017.](https://mlanthology.org/iccv/2017/zhang2017iccv-s3fd/) doi:10.1109/ICCV.2017.30BibTeX
@inproceedings{zhang2017iccv-s3fd,
title = {{S3FD: Single Shot Scale-Invariant Face Detector}},
author = {Zhang, Shifeng and Zhu, Xiangyu and Lei, Zhen and Shi, Hailin and Wang, Xiaobo and Li, Stan Z.},
booktitle = {International Conference on Computer Vision},
year = {2017},
doi = {10.1109/ICCV.2017.30},
url = {https://mlanthology.org/iccv/2017/zhang2017iccv-s3fd/}
}