NETNet: Neighbor Erasing and Transferring Network for Better Single Shot Object Detection

Abstract

Due to the advantages of real-time detection and improved performance, single-shot detectors have gained great attention recently. To solve the complex scale variations, single-shot detectors make scale-aware predictions based on multiple pyramid layers. However, the features in the pyramid are not scale-aware enough, which limits the detection performance. Two common problems in single-shot detectors caused by object scale variations can be observed: (1) small objects are easily missed; (2) the salient part of a large object is sometimes detected as an object. With this observation, we propose a new Neighbor Erasing and Transferring (NET) mechanism to reconfigure the pyramid features and explore scale-aware features. In NET, a Neighbor Erasing Module (NEM) is designed to erase the salient features of large objects and emphasize the features of small objects in shallow layers. A Neighbor Transferring Module (NTM) is introduced to transfer the erased features and highlight large objects in deep layers. With this mechanism, a single-shot network called NETNet is constructed for scale-aware object detection. In addition, we propose to aggregate nearest neighboring pyramid features to enhance our NET. NETNet achieves 38.5% AP at a speed of 27 FPS and 32.0% AP at a speed of 55 FPS on MS COCO dataset. As a result, NETNet achieves a better trade-off for real-time and accurate object detection.

Cite

Text

Li et al. "NETNet: Neighbor Erasing and Transferring Network for Better Single Shot Object Detection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.01336

Markdown

[Li et al. "NETNet: Neighbor Erasing and Transferring Network for Better Single Shot Object Detection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/li2020cvpr-netnet/) doi:10.1109/CVPR42600.2020.01336

BibTeX

@inproceedings{li2020cvpr-netnet,
  title     = {{NETNet: Neighbor Erasing and Transferring Network for Better Single Shot Object Detection}},
  author    = {Li, Yazhao and Pang, Yanwei and Shen, Jianbing and Cao, Jiale and Shao, Ling},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.01336},
  url       = {https://mlanthology.org/cvpr/2020/li2020cvpr-netnet/}
}