Fine-Grained Recognition Without Part Annotations

Abstract

Scaling up fine-grained recognition to all domains of fine-grained objects is a challenge the computer vision community will need to face in order to realize its goal of recognizing all object categories. Current state-of-the-art techniques rely heavily upon the use of keypoint or part annotations, but scaling up to hundreds or thousands of domains renders this annotation cost-prohibitive for all but the most important categories. In this work we propose a method for fine-grained recognition that uses no part annotations. Our method is based on generating parts using co-segmentation and alignment, which we combine in a discriminative mixture. Experimental results show its efficacy, demonstrating state-of-the-art results even when compared to methods that use part annotations during training.

Cite

Text

Krause et al. "Fine-Grained Recognition Without Part Annotations." Conference on Computer Vision and Pattern Recognition, 2015. doi:10.1109/CVPR.2015.7299194

Markdown

[Krause et al. "Fine-Grained Recognition Without Part Annotations." Conference on Computer Vision and Pattern Recognition, 2015.](https://mlanthology.org/cvpr/2015/krause2015cvpr-finegrained/) doi:10.1109/CVPR.2015.7299194

BibTeX

@inproceedings{krause2015cvpr-finegrained,
  title     = {{Fine-Grained Recognition Without Part Annotations}},
  author    = {Krause, Jonathan and Jin, Hailin and Yang, Jianchao and Fei-Fei, Li},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2015},
  doi       = {10.1109/CVPR.2015.7299194},
  url       = {https://mlanthology.org/cvpr/2015/krause2015cvpr-finegrained/}
}