Feature Pyramid Networks for Object Detection

Lin, Tsung-Yi; Dollar, Piotr; Girshick, Ross; He, Kaiming; Hariharan, Bharath; Belongie, Serge

doi:10.1109/CVPR.2017.106

Feature Pyramid Networks for Object Detection

Tsung-Yi Lin, Piotr Dollar, Ross Girshick, Kaiming He, Bharath Hariharan, Serge Belongie

CVPR 2017

doi:10.1109/CVPR.2017.106 /cvpr/2017/lin2017cvpr-feature/

Abstract

Feature pyramids are a basic component in recognition systems for detecting objects at different scales. But pyramid representations have been avoided in recent object detectors that are based on deep convolutional networks, partially because they are slow to compute and memory intensive. In this paper, we exploit the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost. A top-down architecture with lateral connections is developed for building high-level semantic feature maps at all scales. This architecture, called a Feature Pyramid Network (FPN), shows significant improvement as a generic feature extractor in several applications. Using a basic Faster R-CNN system, our method achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles, surpassing all existing single-model entries including those from the COCO 2016 challenge winners. In addition, our method can run at 5 FPS on a GPU and thus is a practical and accurate solution to multi-scale object detection. Code will be made publicly available.

PDF CVPR Semantic Scholar

Cite

Text

Lin et al. "Feature Pyramid Networks for Object Detection." Conference on Computer Vision and Pattern Recognition, 2017. doi:10.1109/CVPR.2017.106

Markdown

[Lin et al. "Feature Pyramid Networks for Object Detection." Conference on Computer Vision and Pattern Recognition, 2017.](https://mlanthology.org/cvpr/2017/lin2017cvpr-feature/) doi:10.1109/CVPR.2017.106

BibTeX

@inproceedings{lin2017cvpr-feature,
  title     = {{Feature Pyramid Networks for Object Detection}},
  author    = {Lin, Tsung-Yi and Dollar, Piotr and Girshick, Ross and He, Kaiming and Hariharan, Bharath and Belongie, Serge},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2017},
  doi       = {10.1109/CVPR.2017.106},
  url       = {https://mlanthology.org/cvpr/2017/lin2017cvpr-feature/}
}