Geometric Neural Phrase Pooling: Modeling the Spatial Co-Occurrence of Neurons

Abstract

Deep Convolutional Neural Networks (CNNs) are playing important roles in state-of-the-art visual recognition. This paper focuses on modeling the spatial co-occurrence of neuron responses, which is less studied in the previous work. For this, we consider the neurons in the hidden layer as neural words, and construct a set of geometric neural phrases on top of them. The idea that grouping neural words into neural phrases is borrowed from the Bag-of-Visual-Words (BoVW) model. Next, the Geometric Neural Phrase Pooling (GNPP) algorithm is proposed to efficiently encode these neural phrases. GNPP acts as a new type of hidden layer, which punishes the isolated neuron responses after convolution, and can be inserted into a CNN model with little extra computational overhead. Experimental results show that GNPP produces significant and consistent accuracy gain in image classification.

Cite

Text

Xie et al. "Geometric Neural Phrase Pooling: Modeling the Spatial Co-Occurrence of Neurons." European Conference on Computer Vision, 2016. doi:10.1007/978-3-319-46448-0_39

Markdown

[Xie et al. "Geometric Neural Phrase Pooling: Modeling the Spatial Co-Occurrence of Neurons." European Conference on Computer Vision, 2016.](https://mlanthology.org/eccv/2016/xie2016eccv-geometric/) doi:10.1007/978-3-319-46448-0_39

BibTeX

@inproceedings{xie2016eccv-geometric,
  title     = {{Geometric Neural Phrase Pooling: Modeling the Spatial Co-Occurrence of Neurons}},
  author    = {Xie, Lingxi and Tian, Qi and Flynn, John and Wang, Jingdong and Yuille, Alan L.},
  booktitle = {European Conference on Computer Vision},
  year      = {2016},
  pages     = {645-661},
  doi       = {10.1007/978-3-319-46448-0_39},
  url       = {https://mlanthology.org/eccv/2016/xie2016eccv-geometric/}
}