Object Detection Using a Max-Margin Hough Transform

Abstract

We present a discriminative Hough transform based object detector where each local part casts a weighted vote for the possible locations of the object center. We show that the weights can be learned in a max-margin framework which directly optimizes the classification performance. The discriminative training takes into account both the codebook appearance and the spatial distribution of its position with respect to the object center to derive its importance. On various datasets we show that the discriminative training improves the Hough detector. Combined with a verification step using a SVM based classifier, our approach achieves a detection rate of 91.9% at 0.3 false positives per image on the ETHZ shape dataset, a significant improvement over the state of the art, while running the verification step on at least an order of magnitude fewer windows than in a sliding window approach.

Cite

Text

Maji and Malik. "Object Detection Using a Max-Margin Hough Transform." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009. doi:10.1109/CVPR.2009.5206693

Markdown

[Maji and Malik. "Object Detection Using a Max-Margin Hough Transform." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009.](https://mlanthology.org/cvpr/2009/maji2009cvpr-object/) doi:10.1109/CVPR.2009.5206693

BibTeX

@inproceedings{maji2009cvpr-object,
  title     = {{Object Detection Using a Max-Margin Hough Transform}},
  author    = {Maji, Subhransu and Malik, Jitendra},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2009},
  pages     = {1038-1045},
  doi       = {10.1109/CVPR.2009.5206693},
  url       = {https://mlanthology.org/cvpr/2009/maji2009cvpr-object/}
}