SPLeaP: Soft Pooling of Learned Parts for Image Classification
Abstract
The aggregation of image statistics – the so-called pooling step of image classification algorithms – as well as the construction of part-based models, are two distinct and well-studied topics in the literature. The former aims at leveraging a whole set of local descriptors that an image can contain (through spatial pyramids or Fisher vectors for instance) while the latter argues that only a few of the regions an image contains are actually useful for its classification. This paper bridges the two worlds by proposing a new pooling framework based on the discovery of useful parts involved in the pooling of local representations. The key contribution lies in a model integrating a boosted non-linear part classifier as well as a parametric soft-max pooling component, both trained jointly with the image classifier. The experimental validation shows that the proposed model not only consistently surpasses standard pooling approaches but also improves over state-of-the-art part-based models, on several different and challenging classification tasks.
Cite
Text
Kulkarni et al. "SPLeaP: Soft Pooling of Learned Parts for Image Classification." European Conference on Computer Vision, 2016. doi:10.1007/978-3-319-46484-8_20Markdown
[Kulkarni et al. "SPLeaP: Soft Pooling of Learned Parts for Image Classification." European Conference on Computer Vision, 2016.](https://mlanthology.org/eccv/2016/kulkarni2016eccv-spleap/) doi:10.1007/978-3-319-46484-8_20BibTeX
@inproceedings{kulkarni2016eccv-spleap,
title = {{SPLeaP: Soft Pooling of Learned Parts for Image Classification}},
author = {Kulkarni, Praveen and Jurie, Frédéric and Zepeda, Joaquin and Pérez, Patrick and Chevallier, Louis},
booktitle = {European Conference on Computer Vision},
year = {2016},
pages = {329-345},
doi = {10.1007/978-3-319-46484-8_20},
url = {https://mlanthology.org/eccv/2016/kulkarni2016eccv-spleap/}
}