Geometric Neural Phrase Pooling: Modeling the Spatial Co-Occurrence of Neurons

Xie, Lingxi; Tian, Qi; Flynn, John; Wang, Jingdong; Yuille, Alan L.

doi:10.1007/978-3-319-46448-0_39

Geometric Neural Phrase Pooling: Modeling the Spatial Co-Occurrence of Neurons

Lingxi Xie, Qi Tian, John Flynn, Jingdong Wang, Alan L. Yuille

ECCV 2016 pp. 645-661

doi:10.1007/978-3-319-46448-0_39 /eccv/2016/xie2016eccv-geometric/

Abstract

Deep Convolutional Neural Networks (CNNs) are playing important roles in state-of-the-art visual recognition. This paper focuses on modeling the spatial co-occurrence of neuron responses, which is less studied in the previous work. For this, we consider the neurons in the hidden layer as neural words, and construct a set of geometric neural phrases on top of them. The idea that grouping neural words into neural phrases is borrowed from the Bag-of-Visual-Words (BoVW) model. Next, the Geometric Neural Phrase Pooling (GNPP) algorithm is proposed to efficiently encode these neural phrases. GNPP acts as a new type of hidden layer, which punishes the isolated neuron responses after convolution, and can be inserted into a CNN model with little extra computational overhead. Experimental results show that GNPP produces significant and consistent accuracy gain in image classification.

PDF ECCV Semantic Scholar

Cite

Text

Xie et al. "Geometric Neural Phrase Pooling: Modeling the Spatial Co-Occurrence of Neurons." European Conference on Computer Vision, 2016. doi:10.1007/978-3-319-46448-0_39

Markdown

[Xie et al. "Geometric Neural Phrase Pooling: Modeling the Spatial Co-Occurrence of Neurons." European Conference on Computer Vision, 2016.](https://mlanthology.org/eccv/2016/xie2016eccv-geometric/) doi:10.1007/978-3-319-46448-0_39

BibTeX

@inproceedings{xie2016eccv-geometric,
  title     = {{Geometric Neural Phrase Pooling: Modeling the Spatial Co-Occurrence of Neurons}},
  author    = {Xie, Lingxi and Tian, Qi and Flynn, John and Wang, Jingdong and Yuille, Alan L.},
  booktitle = {European Conference on Computer Vision},
  year      = {2016},
  pages     = {645-661},
  doi       = {10.1007/978-3-319-46448-0_39},
  url       = {https://mlanthology.org/eccv/2016/xie2016eccv-geometric/}
}