A Codebook-Free and Annotation-Free Approach for Fine-Grained Image Categorization

Abstract

Fine-grained categorization refers to the task of classifying objects that belong to the same basic-level class (e.g. different bird species) and share similar shape or visual appearances. Most of the state-of-the-art basic-level object classification algorithms have difficulties in this challenging problem. One reason for this can be attributed to the popular codebook-based image representation, often resulting in loss of subtle image information that are critical for fine-grained classification. Another way to address this problem is to introduce human annotations of object attributes or key points, a tedious process that is also difficult to generalize to new tasks. In this work, we propose a codebook-free and annotation-free approach for fine-grained image categorization. Instead of using vectorquantized codewords, we obtain an image representation by running a high throughput template matching process using a large number of randomly generated image templates. We then propose a novel bagging-based algorithm to build a final classifier by aggregating a set of discriminative yet largely uncorrelated classifiers. Experimental results show that our method outperforms state-of-the-art classification approaches on the Caltech-UCSD Birds dataset.

Cite

Text

Yao et al. "A Codebook-Free and Annotation-Free Approach for Fine-Grained Image Categorization." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012. doi:10.1109/CVPR.2012.6248088

Markdown

[Yao et al. "A Codebook-Free and Annotation-Free Approach for Fine-Grained Image Categorization." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012.](https://mlanthology.org/cvpr/2012/yao2012cvpr-codebook/) doi:10.1109/CVPR.2012.6248088

BibTeX

@inproceedings{yao2012cvpr-codebook,
  title     = {{A Codebook-Free and Annotation-Free Approach for Fine-Grained Image Categorization}},
  author    = {Yao, Bangpeng and Bradski, Gary R. and Fei-Fei, Li},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2012},
  pages     = {3466-3473},
  doi       = {10.1109/CVPR.2012.6248088},
  url       = {https://mlanthology.org/cvpr/2012/yao2012cvpr-codebook/}
}