Part-Based R-CNNs for Fine-Grained Category Detection
Abstract
Semantic part localization can facilitate fine-grained categorization by explicitly isolating subtle appearance differences associated with specific object parts. Methods for pose-normalized representations have been proposed, but generally presume bounding box annotations at test time due to the difficulty of object detection. We propose a model for fine-grained categorization that overcomes these limitations by leveraging deep convolutional features computed on bottom-up region proposals. Our method learns whole-object and part detectors, enforces learned geometric constraints between them, and predicts a fine-grained category from a pose-normalized representation. Experiments on the Caltech-UCSD bird dataset confirm that our method outperforms state-of-the-art fine-grained categorization methods in an end-to-end evaluation without requiring a bounding box at test time.
Cite
Text
Zhang et al. "Part-Based R-CNNs for Fine-Grained Category Detection." European Conference on Computer Vision, 2014. doi:10.1007/978-3-319-10590-1_54Markdown
[Zhang et al. "Part-Based R-CNNs for Fine-Grained Category Detection." European Conference on Computer Vision, 2014.](https://mlanthology.org/eccv/2014/zhang2014eccv-part/) doi:10.1007/978-3-319-10590-1_54BibTeX
@inproceedings{zhang2014eccv-part,
title = {{Part-Based R-CNNs for Fine-Grained Category Detection}},
author = {Zhang, Ning and Donahue, Jeff and Girshick, Ross B. and Darrell, Trevor},
booktitle = {European Conference on Computer Vision},
year = {2014},
pages = {834-849},
doi = {10.1007/978-3-319-10590-1_54},
url = {https://mlanthology.org/eccv/2014/zhang2014eccv-part/}
}