DamoFD: Digging into Backbone Design on Face Detection

Abstract

Face detection (FD) has achieved remarkable success over the past few years, yet, these leaps often arrive when consuming enormous computation costs. Moreover, when considering a realistic situation, i.e., building a lightweight face detector under a computation-scarce scenario, such heavy computation cost limits the application of the face detector. To remedy this, several pioneering works design tiny face detectors through off-the-shelf neural architecture search (NAS) technologies, which are usually applied to the classification task. Thus, the searched architectures are sub-optimal for the face detection task since some design criteria between detection and classification task are different. As a representative, the face detection backbone design needs to guarantee the stage-level detection ability while it is not required for the classification backbone. Furthermore, the detection backbone consumes a vast body of inference budgets in the whole detection framework. Considering the intrinsic design requirement and the virtual importance role of the face detection backbone, we thus ask a critical question: How to employ NAS to search FD-friendly backbone architecture? To cope with this question, we propose a distribution-dependent stage-aware ranking score (DDSAR-Score) to explicitly characterize the stage-level expressivity and identify the individual importance of each stage, thus satisfying the aforementioned design criterion of the FD backbone. Based on our proposed DDSAR-Score, we conduct comprehensive experiments on the challenging Wider Face benchmark dataset and achieve dominant performance across a wide range of compute regimes. In particular, compared to the tiniest face detector SCRFD-0.5GF, our method is +2.5 % better in Average Precision (AP) score when using the same amount of FLOPs. The code is avaliable at https://github.com/ly19965/FaceMaas/tree/master/face_project/face_detection/DamoFD.

Cite

Text

Liu et al. "DamoFD: Digging into Backbone Design on Face Detection." International Conference on Learning Representations, 2023.

Markdown

[Liu et al. "DamoFD: Digging into Backbone Design on Face Detection." International Conference on Learning Representations, 2023.](https://mlanthology.org/iclr/2023/liu2023iclr-damofd/)

BibTeX

@inproceedings{liu2023iclr-damofd,
  title     = {{DamoFD: Digging into Backbone Design on Face Detection}},
  author    = {Liu, Yang and Deng, Jiankang and Wang, Fei and Shang, Lei and Xie, Xuansong and Sun, Baigui},
  booktitle = {International Conference on Learning Representations},
  year      = {2023},
  url       = {https://mlanthology.org/iclr/2023/liu2023iclr-damofd/}
}