SeaFormer: Squeeze-Enhanced Axial Transformer for Mobile Semantic Segmentation

Abstract

Since the introduction of Vision Transformers, the landscape of many computer vision tasks (e.g., semantic segmentation), which has been overwhelmingly dominated by CNNs, recently has significantly revolutionized. However, the computational cost and memory requirement render these methods unsuitable on the mobile device, especially for the high resolution per-pixel semantic segmentation task. In this paper, we introduce a new method squeeze-enhanced Axial Transformer (SeaFormer) for mobile semantic segmentation. Specifically, we design a generic attention block characterized by the formulation of squeeze Axial and spatial enhancement. It can be further used to create a family of backbone architectures with superior cost-effectiveness. Coupled with a light segmentation head, we demonstrate state-of-the-art results on the ADE20K, Pascal Context and COCO-stuff datasets. Critically, we beat both the mobile-friendly rivals and Transformer-based counterparts with better performance and lower latency without bells and whistles. Beyond semantic segmentation, we further apply the proposed SeaFormer architecture to image classification problem, demonstrating the potentials of serving as a versatile mobile-friendly backbone.

Cite

Text

Wan et al. "SeaFormer: Squeeze-Enhanced Axial Transformer for Mobile Semantic Segmentation." International Conference on Learning Representations, 2023.

Markdown

[Wan et al. "SeaFormer: Squeeze-Enhanced Axial Transformer for Mobile Semantic Segmentation." International Conference on Learning Representations, 2023.](https://mlanthology.org/iclr/2023/wan2023iclr-seaformer/)

BibTeX

@inproceedings{wan2023iclr-seaformer,
  title     = {{SeaFormer: Squeeze-Enhanced Axial Transformer for Mobile Semantic Segmentation}},
  author    = {Wan, Qiang and Huang, Zilong and Lu, Jiachen and Yu, Gang and Zhang, Li},
  booktitle = {International Conference on Learning Representations},
  year      = {2023},
  url       = {https://mlanthology.org/iclr/2023/wan2023iclr-seaformer/}
}