SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution

Abstract

Transformer-based restoration methods achieve significant performance as the self-attention (SA) of the Transformer can explore non-local information for better high-resolution image reconstruction. However, the key dot-product SA requires substantial computational resources, which limits its application in low-power devices. Moreover, the low-pass nature of the SA mechanism limits its capacity for capturing local details, consequently leading to smooth reconstruction results. To address these issues, we propose a self-modulation feature aggregation (SMFA) module to collaboratively exploit both local and non-local feature interactions for a more accurate reconstruction. Specifically, the SMFA module employs an efficient approximation of self-attention (EASA) branch to model non-local information and uses a local detail estimation (LDE) branch to capture local details. Additionally, we further introduce a partial convolution-based feed-forward network (PCFN) to refine the representative features derived from the SMFA. Extensive experiments show that the proposed SMFANet family achieve a better trade-off between reconstruction performance and computational efficiency on public benchmark datasets. In particular, compared to the ×4 SwinIR-light, SMFANet+ achieves 0.14dB higher performance over five public testsets on average, and ×10 times faster runtime, with only about 43% of the model complexity (e.g., FLOPs). Our source codes and pre-trained models are available at: https://github.com/Zheng-MJ/SMFANet.

Cite

Text

Zheng et al. "SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72973-7_21

Markdown

[Zheng et al. "SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/zheng2024eccv-smfanet/) doi:10.1007/978-3-031-72973-7_21

BibTeX

@inproceedings{zheng2024eccv-smfanet,
  title     = {{SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution}},
  author    = {Zheng, Mingjun and Sun, Long and Dong, Jiangxin and Pan, Jinshan},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-72973-7_21},
  url       = {https://mlanthology.org/eccv/2024/zheng2024eccv-smfanet/}
}