DiffFNO: Diffusion Fourier Neural Operator
Abstract
We introduce DiffFNO, a novel diffusion framework for arbitrary-scale super-resolution strengthened by a Weighted Fourier Neural Operator (WFNO). Mode Rebalancing in WFNO effectively captures critical frequency components, significantly improving the reconstruction of high-frequency image details that are crucial for super-resolution tasks. Gated Fusion Mechanism (GFM) adaptively complements WFNO's spectral features with spatial features from an Attention-based Neural Operator (AttnNO). This enhances the network's capability to capture both global structures and local details. Adaptive Time-Step (ATS) ODE solver, a deterministic sampling strategy, accelerates inference without sacrificing output quality by dynamically adjusting integration step sizes ATS. Extensive experiments demonstrate that DiffFNO achieves state-of-the-art (SOTA) results, outperforming existing methods across various scaling factors by a margin of 2-4 dB in PSNR, including those beyond the training distribution. It also achieves this at lower inference time. Our approach sets a new standard in super-resolution, delivering both superior accuracy and computational efficiency.
Cite
Text
Liu and Tang. "DiffFNO: Diffusion Fourier Neural Operator." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.00023Markdown
[Liu and Tang. "DiffFNO: Diffusion Fourier Neural Operator." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/liu2025cvpr-difffno/) doi:10.1109/CVPR52734.2025.00023BibTeX
@inproceedings{liu2025cvpr-difffno,
title = {{DiffFNO: Diffusion Fourier Neural Operator}},
author = {Liu, Xiaoyi and Tang, Hao},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2025},
pages = {150-160},
doi = {10.1109/CVPR52734.2025.00023},
url = {https://mlanthology.org/cvpr/2025/liu2025cvpr-difffno/}
}