Blind Image Inpainting via Omni-Dimensional Gated Attention and Wavelet Queries

Abstract

Blind image inpainting is a crucial restoration task that does not demand additional mask information to restore the corrupted regions. Yet, it is a very less explored research area due to the difficulty in discriminating between corrupted and valid regions. There exist very few approaches for blind image inpainting which sometimes fail at producing plausible inpainted images. Since they follow a common practice of predicting the corrupted regions and then inpaint them. To skip the corrupted region prediction step and obtain better results, in this work, we propose a novel end-to-end architecture for blind image inpainting consisting of wavelet query multi-head attention transformer block and the omni-dimensional gated attention. The proposed wavelet query multi-head attention in the transformer block provides encoder features via processed wavelet coefficients as query to the multi-head attention. Further, the proposed omni-dimensional gated attention effectively provides all dimensional attentive features from the encoder to the respective decoder. Our proposed approach is compared numerically and visually with existing state-of-the-art methods for blind image inpainting on different standard datasets. The comparative and ablation studies prove the effectiveness of the proposed approach for blind image inpainting. The testing code is available at : https://github.com/shrutiphutke/Blind_Omni_Wav_Net

Cite

Text

Phutke et al. "Blind Image Inpainting via Omni-Dimensional Gated Attention and Wavelet Queries." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023. doi:10.1109/CVPRW59228.2023.00132

Markdown

[Phutke et al. "Blind Image Inpainting via Omni-Dimensional Gated Attention and Wavelet Queries." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023.](https://mlanthology.org/cvprw/2023/phutke2023cvprw-blind/) doi:10.1109/CVPRW59228.2023.00132

BibTeX

@inproceedings{phutke2023cvprw-blind,
  title     = {{Blind Image Inpainting via Omni-Dimensional Gated Attention and Wavelet Queries}},
  author    = {Phutke, Shruti S. and Kulkarni, Ashutosh and Vipparthi, Santosh Kumar and Murala, Subrahmanyam},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2023},
  pages     = {1251-1260},
  doi       = {10.1109/CVPRW59228.2023.00132},
  url       = {https://mlanthology.org/cvprw/2023/phutke2023cvprw-blind/}
}