Birdsnap: Large-Scale Fine-Grained Visual Categorization of Birds

Abstract

We address the problem of large-scale fine-grained visual categorization, describing new methods we have used to produce an online field guide to 500 North American bird species. We focus on the challenges raised when such a system is asked to distinguish between highly similar species of birds. First, we introduce "one-vs-most classifiers." By eliminating highly similar species during training, these classifiers achieve more accurate and intuitive results than common one-vs-all classifiers. Second, we show how to estimate spatio-temporal class priors from observations that are sampled at irregular and biased locations. We show how these priors can be used to significantly improve performance. We then show state-of-the-art recognition performance on a new, large dataset that we make publicly available. These recognition methods are integrated into the online field guide, which is also publicly available.

Cite

Text

Berg et al. "Birdsnap: Large-Scale Fine-Grained Visual Categorization of Birds." Conference on Computer Vision and Pattern Recognition, 2014. doi:10.1109/CVPR.2014.259

Markdown

[Berg et al. "Birdsnap: Large-Scale Fine-Grained Visual Categorization of Birds." Conference on Computer Vision and Pattern Recognition, 2014.](https://mlanthology.org/cvpr/2014/berg2014cvpr-birdsnap/) doi:10.1109/CVPR.2014.259

BibTeX

@inproceedings{berg2014cvpr-birdsnap,
  title     = {{Birdsnap: Large-Scale Fine-Grained Visual Categorization of Birds}},
  author    = {Berg, Thomas and Liu, Jiongxin and Lee, Seung Woo and Alexander, Michelle L. and Jacobs, David W. and Belhumeur, Peter N.},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2014},
  doi       = {10.1109/CVPR.2014.259},
  url       = {https://mlanthology.org/cvpr/2014/berg2014cvpr-birdsnap/}
}