Meta-Class Features for Large-Scale Object Categorization on a Budget

Abstract

In this paper we introduce a novel image descriptor enabling accurate object categorization even with linear models. Akin to the popular attribute descriptors, our feature vector comprises the outputs of a set of classifiers evaluated on the image. However, unlike traditional attributes which represent hand-selected object classes and predefined visual properties, our features are learned automatically and correspond to "abstract" categories, which we name meta-classes. Each meta-class is a super-category obtained by grouping a set of object classes such that, collectively, they are easy to distinguish from other sets of categories. By using "learnability" of the meta-classes as criterion for feature generation, we obtain a set of attributes that encode general visual properties shared by multiple object classes and that are effective in describing and recognizing even novel categories, i.e., classes not present in the training set. We demonstrate that simple linear SVMs trained on our meta-class descriptor significantly outperform the best known classifier on the Caltech256 benchmark. We also present results on the 2010 ImageNet Challenge database where our system produces results approaching those of the best systems, but at a much lower computational cost.

Cite

Text

Bergamo and Torresani. "Meta-Class Features for Large-Scale Object Categorization on a Budget." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012. doi:10.1109/CVPR.2012.6248040

Markdown

[Bergamo and Torresani. "Meta-Class Features for Large-Scale Object Categorization on a Budget." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012.](https://mlanthology.org/cvpr/2012/bergamo2012cvpr-meta/) doi:10.1109/CVPR.2012.6248040

BibTeX

@inproceedings{bergamo2012cvpr-meta,
  title     = {{Meta-Class Features for Large-Scale Object Categorization on a Budget}},
  author    = {Bergamo, Alessandro and Torresani, Lorenzo},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2012},
  pages     = {3085-3092},
  doi       = {10.1109/CVPR.2012.6248040},
  url       = {https://mlanthology.org/cvpr/2012/bergamo2012cvpr-meta/}
}