SubmodBoxes: Near-Optimal Search for a Set of Diverse Object Proposals

Abstract

This paper formulates the search for a set of bounding boxes (as needed in object proposal generation) as a monotone submodular maximization problem over the space of all possible bounding boxes in an image. Since the number of possible bounding boxes in an image is very large $O(#pixels^2)$, even a single linear scan to perform the greedy augmentation for submodular maximization is intractable. Thus, we formulate the greedy augmentation step as a Branch-and-Bound scheme. In order to speed up repeated application of B\&B, we propose a novel generalization of Minoux’s ‘lazy greedy’ algorithm to the B\&B tree. Theoretically, our proposed formulation provides a new understanding to the problem, and contains classic heuristic approaches such as Sliding Window+Non-Maximal Suppression (NMS) and and Efficient Subwindow Search (ESS) as special cases. Empirically, we show that our approach leads to a state-of-art performance on object proposal generation via a novel diversity measure.

Cite

Text

Sun and Batra. "SubmodBoxes: Near-Optimal Search for a Set of Diverse Object Proposals." Neural Information Processing Systems, 2015.

Markdown

[Sun and Batra. "SubmodBoxes: Near-Optimal Search for a Set of Diverse Object Proposals." Neural Information Processing Systems, 2015.](https://mlanthology.org/neurips/2015/sun2015neurips-submodboxes/)

BibTeX

@inproceedings{sun2015neurips-submodboxes,
  title     = {{SubmodBoxes: Near-Optimal Search for a Set of Diverse Object Proposals}},
  author    = {Sun, Qing and Batra, Dhruv},
  booktitle = {Neural Information Processing Systems},
  year      = {2015},
  pages     = {1378-1386},
  url       = {https://mlanthology.org/neurips/2015/sun2015neurips-submodboxes/}
}