SAGI: Semantically Aligned and Uncertainty Guided AI Image Inpainting

Abstract

Recent advancements in generative AI have made text-guided image inpainting--adding, removing, or altering image regions using textual prompts--widely accessible. However, generating semantically correct photorealistic imagery, typically requires carefully-crafted prompts and iterative refinement by evaluating the realism of the generated content - tasks commonly performed by humans. To automate the generative process, we propose Semantically Aligned and Uncertainty Guided AI Image Inpainting (SAGI), a model-agnostic pipeline, to sample prompts from a distribution that closely aligns with human perception and to evaluate the generated content and discard instances that deviate from such a distribution, which we approximate using pretrained large language models and vision-language models. By applying this pipeline on multiple state-of-the-art inpainting models, we create the SAGI Dataset SAGI-D, currently the largest and most diverse dataset of AI-generated inpaintings, comprising over 95k inpainted images and a human-evaluated subset. Our experiments show that semantic alignment significantly improves image quality and aesthetics, while uncertainty guidance effectively identifies realistic manipulations -- human ability to distinguish inpainted images from real ones drops from 74% to 35% in terms of accuracy, after applying our pipeline. Moreover, using SAGI-D for training several image forensic approaches increases in-domain detection performance on average by 37.4% and out-of-domain generalization by 26.1% in terms of IoU, also demonstrating its utility in countering malicious exploitation of generative AI. Code and dataset are available at https://mever-team.github.io/SAGI/

Cite

Text

Giakoumoglou et al. "SAGI: Semantically Aligned and Uncertainty Guided AI Image Inpainting." International Conference on Computer Vision, 2025.

Markdown

[Giakoumoglou et al. "SAGI: Semantically Aligned and Uncertainty Guided AI Image Inpainting." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/giakoumoglou2025iccv-sagi/)

BibTeX

@inproceedings{giakoumoglou2025iccv-sagi,
  title     = {{SAGI: Semantically Aligned and Uncertainty Guided AI Image Inpainting}},
  author    = {Giakoumoglou, Paschalis and Karageorgiou, Dimitrios and Papadopoulos, Symeon and Petrantonakis, Panagiotis C.},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {16090-16101},
  url       = {https://mlanthology.org/iccv/2025/giakoumoglou2025iccv-sagi/}
}