Birdlets: Subordinate Categorization Using Volumetric Primitives and Pose-Normalized Appearance
Abstract
Subordinate-level categorization typically rests on establishing salient distinctions between part-level characteristics of objects, in contrast to basic-level categorization, where the presence or absence of parts is determinative. We develop an approach for subordinate categorization in vision, focusing on an avian domain due to the fine-grained structure of the category taxonomy for this domain. We explore a pose-normalized appearance model based on a volumetric poselet scheme. The variation in shape and appearance properties of these parts across a taxonomy provides the cues needed for subordinate categorization. Training pose detectors requires a relatively large amount of training data per category when done from scratch; using a subordinate-level approach, we exploit a pose classifier trained at the basic-level, and extract part appearance and shape information to build subordinate-level models. Our model associates the underlying image pattern parameters used for detection with corresponding volumetric part location, scale and orientation parameters. These parameters implicitly define a mapping from the image pixels into a pose-normalized appearance space, removing view and pose dependencies, facilitating fine-grained categorization from relatively few training examples.
Cite
Text
Farrell et al. "Birdlets: Subordinate Categorization Using Volumetric Primitives and Pose-Normalized Appearance." IEEE/CVF International Conference on Computer Vision, 2011. doi:10.1109/ICCV.2011.6126238Markdown
[Farrell et al. "Birdlets: Subordinate Categorization Using Volumetric Primitives and Pose-Normalized Appearance." IEEE/CVF International Conference on Computer Vision, 2011.](https://mlanthology.org/iccv/2011/farrell2011iccv-birdlets/) doi:10.1109/ICCV.2011.6126238BibTeX
@inproceedings{farrell2011iccv-birdlets,
title = {{Birdlets: Subordinate Categorization Using Volumetric Primitives and Pose-Normalized Appearance}},
author = {Farrell, Ryan and Oza, Om and Zhang, Ning and Morariu, Vlad I. and Darrell, Trevor and Davis, Larry S.},
booktitle = {IEEE/CVF International Conference on Computer Vision},
year = {2011},
pages = {161-168},
doi = {10.1109/ICCV.2011.6126238},
url = {https://mlanthology.org/iccv/2011/farrell2011iccv-birdlets/}
}