Bidirectional Dilation Transformer for Multispectral and Hyperspectral Image Fusion

Abstract

Transformer-based methods have proven to be effective in achieving long-distance modeling, capturing the spatial and spectral information, and exhibiting strong inductive bias in various computer vision tasks. Generally, the Transformer model includes two common modes of multi-head self-attention (MSA): spatial MSA (Spa-MSA) and spectral MSA (Spe-MSA). However, Spa-MSA is computationally efficient but limits the global spatial response within a local window. On the other hand, Spe-MSA can calculate channel self-attention to accommodate high-resolution images, but it disregards the crucial local information that is essential for low-level vision tasks. In this study, we propose a bidirectional dilation Transformer (BDT) for multispectral and hyperspectral image fusion (MHIF), which aims to leverage the advantages of both MSA and the latent multiscale information specific to MHIF tasks. The BDT consists of two designed modules: the dilation Spa-MSA (D-Spa), which dynamically expands the spatial receptive field through a given hollow strategy, and the grouped Spe-MSA (G-Spe), which extracts latent features within the feature map and learns local data behavior. Additionally, to fully exploit the multiscale information from both inputs with different spatial resolutions, we employ a bidirectional hierarchy strategy in the BDT, resulting in improved performance. Finally, extensive experiments on two commonly used datasets, CAVE and Harvard, demonstrate the superiority of BDT both visually and quantitatively. Furthermore, the related code will be available at the GitHub page of the authors.

Cite

Text

Deng et al. "Bidirectional Dilation Transformer for Multispectral and Hyperspectral Image Fusion." International Joint Conference on Artificial Intelligence, 2023. doi:10.24963/IJCAI.2023/404

Markdown

[Deng et al. "Bidirectional Dilation Transformer for Multispectral and Hyperspectral Image Fusion." International Joint Conference on Artificial Intelligence, 2023.](https://mlanthology.org/ijcai/2023/deng2023ijcai-bidirectional/) doi:10.24963/IJCAI.2023/404

BibTeX

@inproceedings{deng2023ijcai-bidirectional,
  title     = {{Bidirectional Dilation Transformer for Multispectral and Hyperspectral Image Fusion}},
  author    = {Deng, Shangqi and Deng, Liang-Jian and Wu, Xiao and Ran, Ran and Wen, Rui},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {3633-3641},
  doi       = {10.24963/IJCAI.2023/404},
  url       = {https://mlanthology.org/ijcai/2023/deng2023ijcai-bidirectional/}
}