Birdlets: Subordinate Categorization Using Volumetric Primitives and Pose-Normalized Appearance

Abstract

Subordinate-level categorization typically rests on establishing salient distinctions between part-level characteristics of objects, in contrast to basic-level categorization, where the presence or absence of parts is determinative. We develop an approach for subordinate categorization in vision, focusing on an avian domain due to the fine-grained structure of the category taxonomy for this domain. We explore a pose-normalized appearance model based on a volumetric poselet scheme. The variation in shape and appearance properties of these parts across a taxonomy provides the cues needed for subordinate categorization. Training pose detectors requires a relatively large amount of training data per category when done from scratch; using a subordinate-level approach, we exploit a pose classifier trained at the basic-level, and extract part appearance and shape information to build subordinate-level models. Our model associates the underlying image pattern parameters used for detection with corresponding volumetric part location, scale and orientation parameters. These parameters implicitly define a mapping from the image pixels into a pose-normalized appearance space, removing view and pose dependencies, facilitating fine-grained categorization from relatively few training examples.

Cite

Text

Farrell et al. "Birdlets: Subordinate Categorization Using Volumetric Primitives and Pose-Normalized Appearance." IEEE/CVF International Conference on Computer Vision, 2011. doi:10.1109/ICCV.2011.6126238

Markdown

[Farrell et al. "Birdlets: Subordinate Categorization Using Volumetric Primitives and Pose-Normalized Appearance." IEEE/CVF International Conference on Computer Vision, 2011.](https://mlanthology.org/iccv/2011/farrell2011iccv-birdlets/) doi:10.1109/ICCV.2011.6126238

BibTeX

@inproceedings{farrell2011iccv-birdlets,
  title     = {{Birdlets: Subordinate Categorization Using Volumetric Primitives and Pose-Normalized Appearance}},
  author    = {Farrell, Ryan and Oza, Om and Zhang, Ning and Morariu, Vlad I. and Darrell, Trevor and Davis, Larry S.},
  booktitle = {IEEE/CVF International Conference on Computer Vision},
  year      = {2011},
  pages     = {161-168},
  doi       = {10.1109/ICCV.2011.6126238},
  url       = {https://mlanthology.org/iccv/2011/farrell2011iccv-birdlets/}
}