Squeeze-and-Attention Networks for Semantic Segmentation

Zhong, Zilong; Lin, Zhong Qiu; Bidart, Rene; Hu, Xiaodan; Daya, Ibrahim Ben; Li, Zhifeng; Zheng, Wei-Shi; Li, Jonathan; Wong, Alexander

doi:10.1109/CVPR42600.2020.01308

Squeeze-and-Attention Networks for Semantic Segmentation

Zilong Zhong, Zhong Qiu Lin, Rene Bidart, Xiaodan Hu, Ibrahim Ben Daya, Zhifeng Li, Wei-Shi Zheng, Jonathan Li, Alexander Wong

CVPR 2020

doi:10.1109/CVPR42600.2020.01308 /cvpr/2020/zhong2020cvpr-squeezeandattention/

Abstract

The recent integration of attention mechanisms into segmentation networks improves their representational capabilities through a great emphasis on more informative features. However, these attention mechanisms ignore an implicit sub-task of semantic segmentation and are constrained by the grid structure of convolution kernels. In this paper, we propose a novel squeeze-and-attention network (SANet) architecture that leverages an effective squeeze-and-attention (SA) module to account for two distinctive characteristics of segmentation: i) pixel-group attention, and ii) pixel-wise prediction. Specifically, the proposed SA modules impose pixel-group attention on conventional convolution by introducing an 'attention' convolutional channel, thus taking into account spatial-channel inter-dependencies in an efficient manner. The final segmentation results are produced by merging outputs from four hierarchical stages of a SANet to integrate multi-scale contexts for obtaining an enhanced pixel-wise prediction. Empirical experiments on two challenging public datasets validate the effectiveness of the proposed SANets, which achieves 83.2 % mIoU (without COCO pre-training) on PASCAL VOC and a state-of-the-art mIoU of 54.4 % on PASCAL Context.

PDF CVPR Semantic Scholar

Cite

Text

Zhong et al. "Squeeze-and-Attention Networks for Semantic Segmentation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.01308

Markdown

[Zhong et al. "Squeeze-and-Attention Networks for Semantic Segmentation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/zhong2020cvpr-squeezeandattention/) doi:10.1109/CVPR42600.2020.01308

BibTeX

@inproceedings{zhong2020cvpr-squeezeandattention,
  title     = {{Squeeze-and-Attention Networks for Semantic Segmentation}},
  author    = {Zhong, Zilong and Lin, Zhong Qiu and Bidart, Rene and Hu, Xiaodan and Daya, Ibrahim Ben and Li, Zhifeng and Zheng, Wei-Shi and Li, Jonathan and Wong, Alexander},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.01308},
  url       = {https://mlanthology.org/cvpr/2020/zhong2020cvpr-squeezeandattention/}
}