CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AI

Abstract

With the rapid advancement of generative AI, it is now possible to synthesize high-quality images in a few seconds. Despite the power of these technologies, they raise significant concerns regarding misuse. Current efforts to distinguish between real and AI-generated images may lack generalization, being effective for only certain types of generative models and susceptible to post-processing techniques like JPEG compression. To overcome these limitations, we propose a novel framework, CO-SPY, that first enhances existing semantic features (e.g., the number of fingers in a hand) and artifact features (e.g., pixel value differences), and then adaptively integrates them to achieve more general and robust synthetic image detection. Additionally, we create CO-SPYBench, a comprehensive dataset comprising 5 real image datasets and 22 state-of-the-art generative models, including the latest models like FLUX. We also collect 50k synthetic images in the wild from the Internet to enable evaluation in a more practical setting. Our extensive evaluations demonstrate that our detector outperforms existing methods under identical training conditions, achieving an average accuracy improvement of approximately 11% to 34%.

Cite

Text

Cheng et al. "CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AI." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.01256

Markdown

[Cheng et al. "CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AI." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/cheng2025cvpr-cospy/) doi:10.1109/CVPR52734.2025.01256

BibTeX

@inproceedings{cheng2025cvpr-cospy,
  title     = {{CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AI}},
  author    = {Cheng, Siyuan and Lyu, Lingjuan and Wang, Zhenting and Zhang, Xiangyu and Sehwag, Vikash},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {13455-13465},
  doi       = {10.1109/CVPR52734.2025.01256},
  url       = {https://mlanthology.org/cvpr/2025/cheng2025cvpr-cospy/}
}