Rethinking Fast Fourier Convolution in Image Inpainting

Abstract

Recently proposed image inpainting method LaMa builds its network upon Fast Fourier Convolution (FFC), which was originally proposed for high-level vision tasks like image classification. FFC empowers the fully convolutional network to have a global receptive field in its early layers. Thanks to the unique character of the FFC module, LaMa has the ability to produce robust repeating texture, which can not be achieved by the previous inpainting methods. However, is the vanilla FFC module suitable for low-level vision tasks like image inpainting? In this paper, we analyze the fundamental flaws of using FFC in image inpainting, which are 1) spectrum shifting, 2) unexpected spatial activation, and 3) limited frequency receptive field. Such flaws make FFC-based inpainting framework difficult in generating complicated texture and performing faithful reconstruction. Based on the above analysis, we propose a novel Unbiased Fast Fourier Convolution (UFFC) module, which modifies the vanilla FFC module with 1) range transform and inverse transform, 2) absolute position embedding, 3) dynamic skip connection, and 4) adaptive clip, to overcome such flaws, achieving better inpainting results. Extensive experiments on several benchmark datasets demonstrate the effectiveness of our method, outperforming the state-of-the-art methods in both texture-capturing ability and expressiveness.

Cite

Text

Chu et al. "Rethinking Fast Fourier Convolution in Image Inpainting." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.02120

Markdown

[Chu et al. "Rethinking Fast Fourier Convolution in Image Inpainting." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/chu2023iccv-rethinking/) doi:10.1109/ICCV51070.2023.02120

BibTeX

@inproceedings{chu2023iccv-rethinking,
  title     = {{Rethinking Fast Fourier Convolution in Image Inpainting}},
  author    = {Chu, Tianyi and Chen, Jiafu and Sun, Jiakai and Lian, Shuobin and Wang, Zhizhong and Zuo, Zhiwen and Zhao, Lei and Xing, Wei and Lu, Dongming},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {23195-23205},
  doi       = {10.1109/ICCV51070.2023.02120},
  url       = {https://mlanthology.org/iccv/2023/chu2023iccv-rethinking/}
}