When Fast Fourier Transform Meets Transformer for Image Restoration
Abstract
Natural images can suffer from various degradation phenomena caused by adverse atmospheric conditions or unique degradation mechanism. Such diversity makes it challenging to design a universal framework for kinds of restoration tasks. Instead of exploring the commonality across different degradation phenomena, existing image restoration methods focus on the modification of network architecture under limited restoration priors. In this work, we first review various degradation phenomena from a frequency perspective as prior. Based on this, we propose an efficient image restoration framework, dubbed SFHformer, which incorporates the Fast Fourier Transform mechanism into Transformer architecture. Specifically, we design a dual domain hybrid structure for multi-scale receptive fields modeling, in which the spatial domain and the frequency domain focuses on local modeling and global modeling, respectively. Moreover, we design unique positional coding and frequency dynamic convolution for each frequency component to extract rich frequency-domain features. Extensive experiments on thirty-one restoration datasets for a range of ten restoration tasks such as deraining, dehazing, deblurring, desnowing, denoising, super-resolution and underwater/low-light enhancement, demonstrate that our SFHformer surpasses the state-of-the-art approaches and achieves a favorable trade-off between performance, parameter size and computational cost. The code is available at: https://github.com/deng-ai-lab/SFHformer.
Cite
Text
Jiang et al. "When Fast Fourier Transform Meets Transformer for Image Restoration." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72995-9_22Markdown
[Jiang et al. "When Fast Fourier Transform Meets Transformer for Image Restoration." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/jiang2024eccv-fast/) doi:10.1007/978-3-031-72995-9_22BibTeX
@inproceedings{jiang2024eccv-fast,
title = {{When Fast Fourier Transform Meets Transformer for Image Restoration}},
author = {Jiang, Xingyu and Zhang, Xiuhui and Gao, Ning and Deng, Yue},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-72995-9_22},
url = {https://mlanthology.org/eccv/2024/jiang2024eccv-fast/}
}