Object Detection in 20 Questions

Abstract

We propose a novel general strategy for object detection. Instead of passively evaluating all object detectors at all possible locations in an image, we develop a divide-and-conquer approach by actively and sequentially evaluating contextual cues related to the query based on the scene and previous evaluations — like playing a "20 Questions" game — to decide where to search for the object. We formulate the problem as a Markov Decision Process and learn a search policy by reinforcement learning. To demonstrate the efficacy of our generic algorithm, we apply the 20 questions approach in the recent framework of simultaneous object detection and segmentation. Experimental results on the Pascal VOC dataset show that our algorithm reduces about 45.3% of the object proposals and 36% of average evaluation time while achieving better average precision compared to exhaustive search.

Cite

Text

Chen et al. "Object Detection in 20 Questions." IEEE/CVF Winter Conference on Applications of Computer Vision, 2016. doi:10.1109/WACV.2016.7477562

Markdown

[Chen et al. "Object Detection in 20 Questions." IEEE/CVF Winter Conference on Applications of Computer Vision, 2016.](https://mlanthology.org/wacv/2016/chen2016wacv-object/) doi:10.1109/WACV.2016.7477562

BibTeX

@inproceedings{chen2016wacv-object,
  title     = {{Object Detection in 20 Questions}},
  author    = {Chen, Xi Stephen and He, He and Davis, Larry S.},
  booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
  year      = {2016},
  pages     = {1-9},
  doi       = {10.1109/WACV.2016.7477562},
  url       = {https://mlanthology.org/wacv/2016/chen2016wacv-object/}
}