Raising the Bar of AI-Generated Image Detection with CLIP

Abstract

The aim of this work is to explore the potential of pre-trained vision-language models (VLMs) for universal detection of AI-generated images. We develop a lightweight detection strategy based on CLIP features and study its performance in a wide variety of challenging scenarios. We find that, contrary to previous beliefs, it is neither necessary nor convenient to use a large domain-specific dataset for training. On the contrary, by using only a handful of example images from a single generative model, a CLIP-based detector exhibits surprising generalization ability and high robustness across different architectures, including recent commercial tools such as Dalle-3, Midjourney v5, and Firefly. We match the state-of-the-art (SoTA) on in-distribution data and significantly improve upon it in terms of generalization to out-of-distribution data (+6% AUC) and robustness to impaired/laundered data (+13%). Our project is available at https://grip-unina.github.io/ClipBased-SyntheticImageDetection/

Cite

Text

Cozzolino et al. "Raising the Bar of AI-Generated Image Detection with CLIP." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00439

Markdown

[Cozzolino et al. "Raising the Bar of AI-Generated Image Detection with CLIP." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/cozzolino2024cvprw-raising/) doi:10.1109/CVPRW63382.2024.00439

BibTeX

@inproceedings{cozzolino2024cvprw-raising,
  title     = {{Raising the Bar of AI-Generated Image Detection with CLIP}},
  author    = {Cozzolino, Davide and Poggi, Giovanni and Corvi, Riccardo and Nießner, Matthias and Verdoliva, Luisa},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2024},
  pages     = {4356-4366},
  doi       = {10.1109/CVPRW63382.2024.00439},
  url       = {https://mlanthology.org/cvprw/2024/cozzolino2024cvprw-raising/}
}