Seeking the Strongest Rigid Detector

Abstract

The current state of the art solutions for object detection describe each class by a set of models trained on discovered sub-classes (so called "components"), with each model itself composed of collections of interrelated parts (deformable models). These detectors build upon the now classic Histogram of Oriented Gradients+linear SVM combo. In this paper we revisit some of the core assumptions in HOG+SVM and show that by properly designing the feature pooling, feature selection, preprocessing, and training methods, it is possible to reach top quality, at least for pedestrian detections, using a single rigid component. We provide experiments for a large design space, that give insights into the design of classifiers, as well as relevant information for practitioners. Our best detector is fully feed-forward, has a single unified architecture, uses only histograms of oriented gradients and colour information in monocular static images, and improves over 23 other methods on the INRIA, ETH and Caltech-USA datasets, reducing the average miss-rate over HOG+SVM by more than 30%.

Cite

Text

Benenson et al. "Seeking the Strongest Rigid Detector." Conference on Computer Vision and Pattern Recognition, 2013. doi:10.1109/CVPR.2013.470

Markdown

[Benenson et al. "Seeking the Strongest Rigid Detector." Conference on Computer Vision and Pattern Recognition, 2013.](https://mlanthology.org/cvpr/2013/benenson2013cvpr-seeking/) doi:10.1109/CVPR.2013.470

BibTeX

@inproceedings{benenson2013cvpr-seeking,
  title     = {{Seeking the Strongest Rigid Detector}},
  author    = {Benenson, Rodrigo and Mathias, Markus and Tuytelaars, Tinne and Van Gool, Luc},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2013},
  doi       = {10.1109/CVPR.2013.470},
  url       = {https://mlanthology.org/cvpr/2013/benenson2013cvpr-seeking/}
}