Boosting ViT-Based MRI Reconstruction from the Perspectives of Frequency Modulation, Spatial Purification, and Scale Diversification
Abstract
The accelerated MRI reconstruction process presents a challenging ill-posed inverse problem due to the extensive under-sampling in k-space. Recently, Vision Transformers (ViTs) have become the mainstream for this task, demonstrating substantial performance improvements. However, there are still three significant issues remain unaddressed: (1) ViTs struggle to capture high-frequency components of images, limiting their ability to detect local textures and edge information, thereby impeding MRI restoration; (2) Previous methods calculate multi-head self-attention (MSA) among both related and unrelated tokens in content, introducing noise and significantly increasing computational burden; (3) The naive feed-forward network in ViTs cannot model the multi-scale information that is important for image restoration. In this paper, we propose FPS-Former, a powerful ViT-based framework, to address these issues from the perspectives of frequency modulation, spatial purification, and scale diversification. Specifically, for issue (1), we introduce a frequency modulation attention module to enhance the self-attention map by adaptively re-calibrating the frequency information in a Laplacian pyramid. For issue (2), we customize a spatial purification attention module to capture interactions among closely related tokens, thereby reducing redundant or irrelevant feature representations. For issue (3), we propose an efficient feed-forward network based on a hybrid-scale fusion strategy. Comprehensive experiments conducted on three public datasets show that our FPS-Former outperforms state-of-the-art methods while requiring lower computational costs.
Cite
Text
Meng et al. "Boosting ViT-Based MRI Reconstruction from the Perspectives of Frequency Modulation, Spatial Purification, and Scale Diversification." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I6.32656Markdown
[Meng et al. "Boosting ViT-Based MRI Reconstruction from the Perspectives of Frequency Modulation, Spatial Purification, and Scale Diversification." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/meng2025aaai-boosting/) doi:10.1609/AAAI.V39I6.32656BibTeX
@inproceedings{meng2025aaai-boosting,
title = {{Boosting ViT-Based MRI Reconstruction from the Perspectives of Frequency Modulation, Spatial Purification, and Scale Diversification}},
author = {Meng, Yucong and Yang, Zhiwei and Shi, Yonghong and Song, Zhijian},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {6135-6143},
doi = {10.1609/AAAI.V39I6.32656},
url = {https://mlanthology.org/aaai/2025/meng2025aaai-boosting/}
}