Is a Detector Only Good for Detection?

Abstract

A common design of an object recognition system has two steps, a detection step followed by a foreground within-class classification step. For example, consider face detection by a boosted cascade of detectors followed by face ID recognition via one-vs-all (OVA) classifiers. Another example is human detection followed by pose recognition. Although the detection step can be quite fast, the foreground within-class classification process can be slow and becomes a bottleneck. In this work, we formulate a filter-and-refine scheme, where the binary outputs of the weak classifiers in a boosted detector are used to identify a small number of candidate foreground state hypotheses quickly via Hamming distance or weighted Hamming distance. The approach is evaluated in three applications: face recognition on the FRGC V2 data set, hand shape detection and parameter estimation on a hand data set and vehicle detection and view angle estimation on a multi-view vehicle data set. On all data sets, our approach has comparable accuracy and is at least five times faster than the brute force approach.

Cite

Text

Yuan and Sclaroff. "Is a Detector Only Good for Detection?." IEEE/CVF International Conference on Computer Vision, 2009. doi:10.1109/ICCV.2009.5459389

Markdown

[Yuan and Sclaroff. "Is a Detector Only Good for Detection?." IEEE/CVF International Conference on Computer Vision, 2009.](https://mlanthology.org/iccv/2009/yuan2009iccv-detector/) doi:10.1109/ICCV.2009.5459389

BibTeX

@inproceedings{yuan2009iccv-detector,
  title     = {{Is a Detector Only Good for Detection?}},
  author    = {Yuan, Quan and Sclaroff, Stan},
  booktitle = {IEEE/CVF International Conference on Computer Vision},
  year      = {2009},
  pages     = {1066-1073},
  doi       = {10.1109/ICCV.2009.5459389},
  url       = {https://mlanthology.org/iccv/2009/yuan2009iccv-detector/}
}