A Design Principle for Coarse-to-Fine Classification

Abstract

Coarse-to-fine classification is an efficient way of organizing object recognition in order to accommodate a large number of possible hypotheses and to systematically exploit shared attributes and the hierarchical nature of the visual world. The basic structure is a nested representation of the space of hypotheses and a corresponding hierarchy of (binary) classifiers. In existing work, the representation is manually crafted. Here we introduce a design principle for recursively learning the representation and the classifiers together. This also unifies previous work on cascades and tree-structured search. The criterion for deciding when a group of hypotheses should be "retested" (a cascade) versus partitioned into smaller groups ("divide-and-conquer") is motivated by recent theoretical work on optimal search strategies. The key concept is the cost-to-power ratio of a classifier. The learned hierarchy consists of both linear cascades and branching segments and outperforms manual ones in experiments on face detection.

Cite

Text

Gangaputra and Geman. "A Design Principle for Coarse-to-Fine Classification." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2006. doi:10.1109/CVPR.2006.21

Markdown

[Gangaputra and Geman. "A Design Principle for Coarse-to-Fine Classification." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2006.](https://mlanthology.org/cvpr/2006/gangaputra2006cvpr-design/) doi:10.1109/CVPR.2006.21

BibTeX

@inproceedings{gangaputra2006cvpr-design,
  title     = {{A Design Principle for Coarse-to-Fine Classification}},
  author    = {Gangaputra, Sachin and Geman, Donald},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2006},
  pages     = {1877-1884},
  doi       = {10.1109/CVPR.2006.21},
  url       = {https://mlanthology.org/cvpr/2006/gangaputra2006cvpr-design/}
}