Enriched Feature Guided Refinement Network for Object Detection
Abstract
We propose a single-stage detection framework that jointly tackles the problem of multi-scale object detection and class imbalance. Rather than designing deeper networks, we introduce a simple yet effective feature enrichment scheme to produce multi-scale contextual features. We further introduce a cascaded refinement scheme which first instills multi-scale contextual features into the prediction layers of the single-stage detector in order to enrich their discriminative power for multi-scale detection. Second, the cascaded refinement scheme counters the class imbalance problem by refining the anchors and enriched features to improve classification and regression. Experiments are performed on two benchmarks: PASCAL VOC and MS COCO. For a 320x320 input on the MS COCO test-dev, our detector achieves state-of-the-art single-stage detection accuracy with a COCO AP of 33.2 in the case of single-scale inference, while operating at 21 milliseconds on a Titan XP GPU. For a 512x512 input on the MS COCO test-dev, our approach obtains an absolute gain of 1.6% in terms of COCO AP, compared to the best reported single-stage results[5]. Source code and models are available at: https://github.com/Ranchentx/EFGRNet.
Cite
Text
Nie et al. "Enriched Feature Guided Refinement Network for Object Detection." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019. doi:10.1109/ICCV.2019.00963Markdown
[Nie et al. "Enriched Feature Guided Refinement Network for Object Detection." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.](https://mlanthology.org/iccv/2019/nie2019iccv-enriched/) doi:10.1109/ICCV.2019.00963BibTeX
@inproceedings{nie2019iccv-enriched,
title = {{Enriched Feature Guided Refinement Network for Object Detection}},
author = {Nie, Jing and Anwer, Rao Muhammad and Cholakkal, Hisham and Khan, Fahad Shahbaz and Pang, Yanwei and Shao, Ling},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision},
year = {2019},
doi = {10.1109/ICCV.2019.00963},
url = {https://mlanthology.org/iccv/2019/nie2019iccv-enriched/}
}