Mask-Guided Spectral-Wise Transformer for Efficient Hyperspectral Image Reconstruction
Abstract
Hyperspectral image (HSI) reconstruction aims to recover the 3D spatial-spectral signal from a 2D measurement in the coded aperture snapshot spectral imaging (CASSI) system. The HSI representations are highly similar and correlated across the spectral dimension. Modeling the inter-spectra interactions is beneficial for HSI reconstruction. However, existing CNN-based methods show limitations in capturing spectral-wise similarity and long-range dependencies. Besides, the HSI information is modulated by a coded aperture (physical mask) in CASSI. Nonetheless, current algorithms have not fully explored the guidance effect of the mask for HSI restoration. In this paper, we propose a novel framework, Mask-guided Spectral-wise Transformer (MST), for HSI reconstruction. Specifically, we present a Spectral-wise Multi-head Self-Attention (S-MSA) that treats each spectral feature as a token and calculates self-attention along the spectral dimension. In addition, we customize a Mask-guided Mechanism (MM) that directs S-MSA to pay attention to spatial regions with high-fidelity spectral representations. Extensive experiments show that our MST significantly outperforms state-of-the-art (SOTA) methods on simulation and real HSI datasets while requiring dramatically cheaper computational and memory costs. https://github.com/caiyuanhao1998/MST/
Cite
Text
Cai et al. "Mask-Guided Spectral-Wise Transformer for Efficient Hyperspectral Image Reconstruction." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.01698Markdown
[Cai et al. "Mask-Guided Spectral-Wise Transformer for Efficient Hyperspectral Image Reconstruction." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/cai2022cvpr-maskguided/) doi:10.1109/CVPR52688.2022.01698BibTeX
@inproceedings{cai2022cvpr-maskguided,
title = {{Mask-Guided Spectral-Wise Transformer for Efficient Hyperspectral Image Reconstruction}},
author = {Cai, Yuanhao and Lin, Jing and Hu, Xiaowan and Wang, Haoqian and Yuan, Xin and Zhang, Yulun and Timofte, Radu and Van Gool, Luc},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2022},
pages = {17502-17511},
doi = {10.1109/CVPR52688.2022.01698},
url = {https://mlanthology.org/cvpr/2022/cai2022cvpr-maskguided/}
}