AugFPN: Improving Multi-Scale Feature Learning for Object Detection

Guo, Chaoxu; Fan, Bin; Zhang, Qian; Xiang, Shiming; Pan, Chunhong

doi:10.1109/CVPR42600.2020.01261

AugFPN: Improving Multi-Scale Feature Learning for Object Detection

Chaoxu Guo, Bin Fan, Qian Zhang, Shiming Xiang, Chunhong Pan

CVPR 2020

doi:10.1109/CVPR42600.2020.01261 /cvpr/2020/guo2020cvpr-augfpn/

Abstract

Current state-of-the-art detectors typically exploit feature pyramid to detect objects at different scales. Among them, FPN is one of the representative works that build a feature pyramid by multi-scale features summation. However, the design defects behind prevent the multi-scale features from being fully exploited. In this paper, we begin by first analyzing the design defects of feature pyramid in FPN, and then introduce a new feature pyramid architecture named AugFPN to address these problems. Specifically, AugFPN consists of three components: Consistent Supervision, Residual Feature Augmentation, and Soft RoI Selection. AugFPN narrows the semantic gaps between features of different scales before feature fusion through Consistent Supervision. In feature fusion, ratio-invariant context information is extracted by Residual Feature Augmentation to reduce the information loss of feature map at the highest pyramid level. Finally, Soft RoI Selection is employed to learn a better RoI feature adaptively after feature fusion. By replacing FPN with AugFPN in Faster R-CNN, our models achieve 2.3 and 1.6 points higher Average Precision (AP) when using ResNet50 and MobileNet-v2 as backbone respectively. Furthermore, AugFPN improves RetinaNet by 1.6 points AP and FCOS by 0.9 points AP when using ResNet50 as backbone. Codes are available on https://github.com/Gus-Guo/AugFPN.

PDF CVPR Semantic Scholar

Cite

Text

Guo et al. "AugFPN: Improving Multi-Scale Feature Learning for Object Detection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.01261

Markdown

[Guo et al. "AugFPN: Improving Multi-Scale Feature Learning for Object Detection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/guo2020cvpr-augfpn/) doi:10.1109/CVPR42600.2020.01261

BibTeX

@inproceedings{guo2020cvpr-augfpn,
  title     = {{AugFPN: Improving Multi-Scale Feature Learning for Object Detection}},
  author    = {Guo, Chaoxu and Fan, Bin and Zhang, Qian and Xiang, Shiming and Pan, Chunhong},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.01261},
  url       = {https://mlanthology.org/cvpr/2020/guo2020cvpr-augfpn/}
}