DeepBox: Learning Objectness with Convolutional Networks

Weicheng Kuo, Bharath Hariharan, Jitendra Malik

ICCV 2015

doi:10.1109/ICCV.2015.285 /iccv/2015/kuo2015iccv-deepbox/

Abstract

Existing object proposal approaches use primarily bottom-up cues to rank proposals, while we believe that "objectness" is in fact a high level construct. We argue for a data-driven, semantic approach for ranking object proposals. Our framework, which we call DeepBox, uses convolutional neural networks (CNNs) to rerank proposals from a bottom-up method. We use a novel four-layer CNN architecture that is as good as much larger networks on the task of evaluating objectness while being much faster. We show that DeepBox significantly improves over the bottom-up ranking, achieving the same recall with 500 proposals as achieved by bottom-up methods with 2000. This improvement generalizes to categories the CNN has never seen before and leads to a 4.5-point gain in detection mAP. Our implementation achieves this performance while running at 260 ms per image.

PDF ICCV Semantic Scholar

Cite

Text

Kuo et al. "DeepBox: Learning Objectness with Convolutional Networks." International Conference on Computer Vision, 2015. doi:10.1109/ICCV.2015.285

Markdown

[Kuo et al. "DeepBox: Learning Objectness with Convolutional Networks." International Conference on Computer Vision, 2015.](https://mlanthology.org/iccv/2015/kuo2015iccv-deepbox/) doi:10.1109/ICCV.2015.285

BibTeX

@inproceedings{kuo2015iccv-deepbox,
  title     = {{DeepBox: Learning Objectness with Convolutional Networks}},
  author    = {Kuo, Weicheng and Hariharan, Bharath and Malik, Jitendra},
  booktitle = {International Conference on Computer Vision},
  year      = {2015},
  doi       = {10.1109/ICCV.2015.285},
  url       = {https://mlanthology.org/iccv/2015/kuo2015iccv-deepbox/}
}