SAIR: Learning Semantic-Aware Implicit Representation

Abstract

Implicit representation of an image can map arbitrary coordinates in the continuous domain to their corresponding color values, presenting a powerful capability for image reconstruction. Nevertheless, existing implicit representation approaches only focus on building continuous appearance mapping, ignoring the continuities of the semantic information across pixels. Consequently, achieving the desired reconstruction results becomes challenging when the semantic information within input image is corrupted, such as when a large region is missing. To address the issue, we suggest learning semantic-aware implicit representation ( SAIR), that is, we make the implicit representation of each pixel rely on both its appearance and semantic information (, which object does the pixel belong to). To this end, we propose a framework with two modules: (1) a semantic implicit representation (SIR) for a corrupted image. Given an arbitrary coordinate in the continuous domain, we can obtain its respective text-aligned embedding indicating the object the pixel belongs. (2) an appearance implicit representation (AIR) based on the SIR. Given an arbitrary coordinate in the continuous domain, we can reconstruct its color whether or not the pixel is missed in the input. We validate the novel semantic-aware implicit representation method on the image inpainting task, and the extensive experiments demonstrate that our method surpasses state-of-the-art approaches by a significant margin.

Cite

Text

Zhang et al. "SAIR: Learning Semantic-Aware Implicit Representation." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73235-5_18

Markdown

[Zhang et al. "SAIR: Learning Semantic-Aware Implicit Representation." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/zhang2024eccv-sair/) doi:10.1007/978-3-031-73235-5_18

BibTeX

@inproceedings{zhang2024eccv-sair,
  title     = {{SAIR: Learning Semantic-Aware Implicit Representation}},
  author    = {Zhang, Canyu and Li, Xiaoguang and Guo, Qing and Wang, Song},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73235-5_18},
  url       = {https://mlanthology.org/eccv/2024/zhang2024eccv-sair/}
}