SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution
Abstract
Transformer-based restoration methods achieve significant performance as the self-attention (SA) of the Transformer can explore non-local information for better high-resolution image reconstruction. However, the key dot-product SA requires substantial computational resources, which limits its application in low-power devices. Moreover, the low-pass nature of the SA mechanism limits its capacity for capturing local details, consequently leading to smooth reconstruction results. To address these issues, we propose a self-modulation feature aggregation (SMFA) module to collaboratively exploit both local and non-local feature interactions for a more accurate reconstruction. Specifically, the SMFA module employs an efficient approximation of self-attention (EASA) branch to model non-local information and uses a local detail estimation (LDE) branch to capture local details. Additionally, we further introduce a partial convolution-based feed-forward network (PCFN) to refine the representative features derived from the SMFA. Extensive experiments show that the proposed SMFANet family achieve a better trade-off between reconstruction performance and computational efficiency on public benchmark datasets. In particular, compared to the ×4 SwinIR-light, SMFANet+ achieves 0.14dB higher performance over five public testsets on average, and ×10 times faster runtime, with only about 43% of the model complexity (e.g., FLOPs). Our source codes and pre-trained models are available at: https://github.com/Zheng-MJ/SMFANet.
Cite
Text
Zheng et al. "SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72973-7_21Markdown
[Zheng et al. "SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/zheng2024eccv-smfanet/) doi:10.1007/978-3-031-72973-7_21BibTeX
@inproceedings{zheng2024eccv-smfanet,
title = {{SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution}},
author = {Zheng, Mingjun and Sun, Long and Dong, Jiangxin and Pan, Jinshan},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-72973-7_21},
url = {https://mlanthology.org/eccv/2024/zheng2024eccv-smfanet/}
}