AAFormer: A Multi-Modal Transformer Network for Aerial Agricultural Images

Abstract

The semantic segmentation of agricultural aerial images is very important for the recognition and analysis of farmland anomaly patterns, such as drydown, endrow, nutrient deficiency, etc. Methods for general semantic segmentation such as Fully Convolutional Networks can extract rich semantic features, but are difficult to exploit the long-range information. Recently, vision Transformer architectures have made outstanding performances in image segmentation tasks, but transformer-based models have not been fully explored in the field of agriculture.Therefore, we propose a novel architecture called Agricultural Aerial Transformer (AAFormer) to solve the semantic segmentation of aerial farmland images. We adopt Mix Transformer (MiT) in the encoder stage to enhance the ability of field anomaly pattern recognition and leverage the Squeeze-and-Excitation (SE) module in the decoder stage to improve the effectiveness of key channels. The boundary maps of farmland are introduced into the decoder. Evaluated on the Agriculture-Vision validation set, the mIoU of our proposed model reaches 45.44%.

Cite

Text

Shen et al. "AAFormer: A Multi-Modal Transformer Network for Aerial Agricultural Images." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022. doi:10.1109/CVPRW56347.2022.00177

Markdown

[Shen et al. "AAFormer: A Multi-Modal Transformer Network for Aerial Agricultural Images." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022.](https://mlanthology.org/cvprw/2022/shen2022cvprw-aaformer/) doi:10.1109/CVPRW56347.2022.00177

BibTeX

@inproceedings{shen2022cvprw-aaformer,
  title     = {{AAFormer: A Multi-Modal Transformer Network for Aerial Agricultural Images}},
  author    = {Shen, Yao and Wang, Lei and Jin, Yue},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2022},
  pages     = {1704-1710},
  doi       = {10.1109/CVPRW56347.2022.00177},
  url       = {https://mlanthology.org/cvprw/2022/shen2022cvprw-aaformer/}
}