Similarity Comparisons for Interactive Fine-Grained Categorization

Abstract

Current human-in-the-loop fine-grained visual categorization systems depend on a predefined vocabulary of attributes and parts, usually determined by experts. In this work, we move away from that expert-driven and attribute-centric paradigm and present a novel interactive classification system that incorporates computer vision and perceptual similarity metrics in a unified framework. At test time, users are asked to judge relative similarity between a query image and various sets of images; these general queries do not require expert-defined terminology and are applicable to other domains and basic-level categories, enabling a flexible, efficient, and scalable system for fine-grained categorization with humans in the loop. Our system outperforms existing state-of-the-art systems for relevance feedback-based image retrieval as well as interactive classification, resulting in a reduction of up to 43% in the average number of questions needed to correctly classify an image.

Cite

Text

Wah et al. "Similarity Comparisons for Interactive Fine-Grained Categorization." Conference on Computer Vision and Pattern Recognition, 2014. doi:10.1109/CVPR.2014.115

Markdown

[Wah et al. "Similarity Comparisons for Interactive Fine-Grained Categorization." Conference on Computer Vision and Pattern Recognition, 2014.](https://mlanthology.org/cvpr/2014/wah2014cvpr-similarity/) doi:10.1109/CVPR.2014.115

BibTeX

@inproceedings{wah2014cvpr-similarity,
  title     = {{Similarity Comparisons for Interactive Fine-Grained Categorization}},
  author    = {Wah, Catherine and Van Horn, Grant and Branson, Steve and Maji, Subhransu and Perona, Pietro and Belongie, Serge},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2014},
  doi       = {10.1109/CVPR.2014.115},
  url       = {https://mlanthology.org/cvpr/2014/wah2014cvpr-similarity/}
}