Strip Pooling: Rethinking Spatial Pooling for Scene Parsing

Abstract

Spatial pooling has been proven highly effective to capture long-range contextual information for pixel-wise prediction tasks, such as scene parsing. In this paper, beyond conventional spatial pooling that usually has a regular shape of NxN, we rethink the formulation of spatial pooling by introducing a new pooling strategy, called strip pooling, which considers a long but narrow kernel, i.e., 1xN or Nx1. Based on strip pooling, we further investigate spatial pooling architecture design by 1) introducing a new strip pooling module that enables backbone networks to efficiently model long-range dependencies; 2) presenting a novel building block with diverse spatial pooling as a core; and 3) systematically comparing the performance of the proposed strip pooling and conventional spatial pooling techniques. Both novel pooling-based designs are lightweight and can serve as an efficient plug-and-play modules in existing scene parsing networks. Extensive experiments on Cityscapes and ADE20K benchmarks demonstrate that our simple approach establishes new state-of-the-art results. Code is available at https://github.com/Andrew-Qibin/SPNet.

Cite

Text

Hou et al. "Strip Pooling: Rethinking Spatial Pooling for Scene Parsing." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00406

Markdown

[Hou et al. "Strip Pooling: Rethinking Spatial Pooling for Scene Parsing." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/hou2020cvpr-strip/) doi:10.1109/CVPR42600.2020.00406

BibTeX

@inproceedings{hou2020cvpr-strip,
  title     = {{Strip Pooling: Rethinking Spatial Pooling for Scene Parsing}},
  author    = {Hou, Qibin and Zhang, Li and Cheng, Ming-Ming and Feng, Jiashi},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.00406},
  url       = {https://mlanthology.org/cvpr/2020/hou2020cvpr-strip/}
}