Soft-NMS -- Improving Object Detection with One Line of Code
Abstract
Non-maximum suppression is an integral part of the object detection pipeline. First, it sorts all detection boxes on the basis of their scores. The detection box M with the maximum score is selected and all other detection boxes with a significant overlap (using a pre-defined threshold) with M are suppressed. This process is recursively applied on the remaining boxes. As per the design of the algorithm, if an object lies within the predefined overlap threshold, it leads to a miss. To this end, we propose Soft-NMS, an algorithm which decays the detection scores of all other objects as a continuous function of their overlap with M. Hence, no object is eliminated in this process. Soft-NMS obtains consistent improvements for the coco-style mAP metric on standard datasets like PASCAL VOC 2007 (1.7% for both R-FCN and Faster-RCNN) and MS-COCO (1.3% for R-FCN and 1.1% for Faster-RCNN) by just changing the NMS algorithm without any additional hyper-parameters. Using Deformable-RFCN, Soft-NMS improves state-of-the-art in object detection from 39.8% to 40.9% with a single model. Further, the computational complexity of Soft-NMS is the same as traditional NMS and hence it can be efficiently implemented. Since Soft-NMS does not require any extra training and is simple to implement, it can be easily integrated into any object detection pipeline. Code for Soft-NMS is publicly available on GitHub
Cite
Text
Bodla et al. "Soft-NMS -- Improving Object Detection with One Line of Code." International Conference on Computer Vision, 2017. doi:10.1109/ICCV.2017.593Markdown
[Bodla et al. "Soft-NMS -- Improving Object Detection with One Line of Code." International Conference on Computer Vision, 2017.](https://mlanthology.org/iccv/2017/bodla2017iccv-softnms/) doi:10.1109/ICCV.2017.593BibTeX
@inproceedings{bodla2017iccv-softnms,
title = {{Soft-NMS -- Improving Object Detection with One Line of Code}},
author = {Bodla, Navaneeth and Singh, Bharat and Chellappa, Rama and Davis, Larry S.},
booktitle = {International Conference on Computer Vision},
year = {2017},
doi = {10.1109/ICCV.2017.593},
url = {https://mlanthology.org/iccv/2017/bodla2017iccv-softnms/}
}