Understanding Objects in Detail with Fine-Grained Attributes

Abstract

We study the problem of understanding objects in detail, intended as recognizing a wide array of fine-grained object attributes. To this end, we introduce a dataset of 7,413 airplanes annotated in detail with parts and their attributes, leveraging images donated by airplane spotters and crowdsourcing both the design and collection of the detailed annotations. We provide a number of insights that should help researchers interested in designing fine-grained datasets for other basic level categories. We show that the collected data can be used to study the relation between part detection and attribute prediction by diagnosing the performance of classifiers that pool information from different parts of an object. We note that the prediction of certain attributes can benefit substantially from accurate part detection. We also show that, differently from previous results in object detection, employing a large number of part templates can improve detection accuracy at the expenses of detection speed. We finally propose a coarse-to-fine approach to speed up detection through a hierarchical cascade algorithm.

Cite

Text

Vedaldi et al. "Understanding Objects in Detail with Fine-Grained Attributes." Conference on Computer Vision and Pattern Recognition, 2014. doi:10.1109/CVPR.2014.463

Markdown

[Vedaldi et al. "Understanding Objects in Detail with Fine-Grained Attributes." Conference on Computer Vision and Pattern Recognition, 2014.](https://mlanthology.org/cvpr/2014/vedaldi2014cvpr-understanding/) doi:10.1109/CVPR.2014.463

BibTeX

@inproceedings{vedaldi2014cvpr-understanding,
  title     = {{Understanding Objects in Detail with Fine-Grained Attributes}},
  author    = {Vedaldi, Andrea and Mahendran, Siddharth and Tsogkas, Stavros and Maji, Subhransu and Girshick, Ross and Kannala, Juho and Rahtu, Esa and Kokkinos, Iasonas and Blaschko, Matthew B. and Weiss, David and Taskar, Ben and Simonyan, Karen and Saphra, Naomi and Mohamed, Sammy},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2014},
  doi       = {10.1109/CVPR.2014.463},
  url       = {https://mlanthology.org/cvpr/2014/vedaldi2014cvpr-understanding/}
}