Geometric Neural Phrase Pooling: Modeling the Spatial Co-Occurrence of Neurons
Abstract
Deep Convolutional Neural Networks (CNNs) are playing important roles in state-of-the-art visual recognition. This paper focuses on modeling the spatial co-occurrence of neuron responses, which is less studied in the previous work. For this, we consider the neurons in the hidden layer as neural words, and construct a set of geometric neural phrases on top of them. The idea that grouping neural words into neural phrases is borrowed from the Bag-of-Visual-Words (BoVW) model. Next, the Geometric Neural Phrase Pooling (GNPP) algorithm is proposed to efficiently encode these neural phrases. GNPP acts as a new type of hidden layer, which punishes the isolated neuron responses after convolution, and can be inserted into a CNN model with little extra computational overhead. Experimental results show that GNPP produces significant and consistent accuracy gain in image classification.
Cite
Text
Xie et al. "Geometric Neural Phrase Pooling: Modeling the Spatial Co-Occurrence of Neurons." European Conference on Computer Vision, 2016. doi:10.1007/978-3-319-46448-0_39Markdown
[Xie et al. "Geometric Neural Phrase Pooling: Modeling the Spatial Co-Occurrence of Neurons." European Conference on Computer Vision, 2016.](https://mlanthology.org/eccv/2016/xie2016eccv-geometric/) doi:10.1007/978-3-319-46448-0_39BibTeX
@inproceedings{xie2016eccv-geometric,
title = {{Geometric Neural Phrase Pooling: Modeling the Spatial Co-Occurrence of Neurons}},
author = {Xie, Lingxi and Tian, Qi and Flynn, John and Wang, Jingdong and Yuille, Alan L.},
booktitle = {European Conference on Computer Vision},
year = {2016},
pages = {645-661},
doi = {10.1007/978-3-319-46448-0_39},
url = {https://mlanthology.org/eccv/2016/xie2016eccv-geometric/}
}