Selective Pooling Vector for Fine-Grained Recognition
Abstract
We propose a new framework for image recognition by selectively pooling local visual descriptors, and show its superior discriminative power on fine-grained image classification tasks. The representation is based on selecting the most confident local descriptors for nonlinear function learning using a linear approximation in an embedded higher dimensional space. The advantage of our Selective Pooling Vector over the previous state-of-the-art Super Vector and Fisher Vector representations, is that it ensures a more accurate learning function, which proves to be important for classifying details in fine-grained image recognition. Our experimental results corroborate this claim: with a simple linear SVM as the classifier, the selective pooling vector achieves significant performance gains on standard benchmark datasets for various fine-grained tasks such as the CMU Multi-PIE dataset for face recognition, the Caltech-UCSD Bird dataset and the Stanford Dogs dataset for fine-grained object categorization. On all datasets we outperform the state of the arts and boost the recognition rates to 96.4%, 48.9%, 52.0% respectively.
Cite
Text
Chen et al. "Selective Pooling Vector for Fine-Grained Recognition." IEEE/CVF Winter Conference on Applications of Computer Vision, 2015. doi:10.1109/WACV.2015.119Markdown
[Chen et al. "Selective Pooling Vector for Fine-Grained Recognition." IEEE/CVF Winter Conference on Applications of Computer Vision, 2015.](https://mlanthology.org/wacv/2015/chen2015wacv-selective/) doi:10.1109/WACV.2015.119BibTeX
@inproceedings{chen2015wacv-selective,
title = {{Selective Pooling Vector for Fine-Grained Recognition}},
author = {Chen, Guang and Yang, Jianchao and Jin, Hailin and Shechtman, Eli and Brandt, Jonathan and Han, Tony X.},
booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
year = {2015},
pages = {860-867},
doi = {10.1109/WACV.2015.119},
url = {https://mlanthology.org/wacv/2015/chen2015wacv-selective/}
}