Learning and Using Taxonomies for Fast Visual Categorization

Abstract

The computational complexity of current visual categorization algorithms scales linearly at best with the number of categories. The goal of classifying simultaneously N <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">cat</sub> = 10 <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">4</sup> - 10 <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">5</sup> visual categories requires sub-linear classification costs. We explore algorithms for automatically building classification trees which have, in principle, logNcat complexity. We find that a greedy algorithm that recursively splits the set of categories into the two minimally confused subsets achieves 5-20 fold speedups at a small cost in classification performance. Our approach is independent of the specific classification algorithm used. A welcome by-product of our algorithm is a very reasonable taxonomy of the Caltech-256 dataset.

Cite

Text

Griffin and Perona. "Learning and Using Taxonomies for Fast Visual Categorization." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008. doi:10.1109/CVPR.2008.4587410

Markdown

[Griffin and Perona. "Learning and Using Taxonomies for Fast Visual Categorization." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008.](https://mlanthology.org/cvpr/2008/griffin2008cvpr-learning/) doi:10.1109/CVPR.2008.4587410

BibTeX

@inproceedings{griffin2008cvpr-learning,
  title     = {{Learning and Using Taxonomies for Fast Visual Categorization}},
  author    = {Griffin, Gregory and Perona, Pietro},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2008},
  doi       = {10.1109/CVPR.2008.4587410},
  url       = {https://mlanthology.org/cvpr/2008/griffin2008cvpr-learning/}
}