Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition
Abstract
Compared to earlier multistage frameworks using CNN features, recent end-to-end deep approaches for fine-grained recognition essentially enhance the mid-level learning capability of CNNs. Previous approaches achieve this by introducing an auxiliary network to infuse localization information into the main classification network, or a sophisticated feature encoding method to capture higher order feature statistics. We show that mid-level representation learning can be enhanced within the CNN framework, by learning a bank of convolutional filters that capture class-specific discriminative patches without extra part or bounding box annotations. Such a filter bank is well structured, properly initialized and discriminatively learned through a novel asymmetric multi-stream architecture with convolutional filter supervision and a non-random layer initialization. Experimental results show that our approach achieves state-of-the-art on three publicly available fine-grained recognition datasets (CUB-200-2011, Stanford Cars and FGVC-Aircraft). Ablation studies and visualizations are further provided to understand our approach.
Cite
Text
Wang et al. "Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. doi:10.1109/CVPR.2018.00436Markdown
[Wang et al. "Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.](https://mlanthology.org/cvpr/2018/wang2018cvpr-learning-a/) doi:10.1109/CVPR.2018.00436BibTeX
@inproceedings{wang2018cvpr-learning-a,
title = {{Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition}},
author = {Wang, Yaming and Morariu, Vlad I. and Davis, Larry S.},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2018},
doi = {10.1109/CVPR.2018.00436},
url = {https://mlanthology.org/cvpr/2018/wang2018cvpr-learning-a/}
}