Attack-Resistant Watermarking for AIGC Image Forensics via Diffusion-Based Semantic Deflection

Abstract

Protecting the copyright of user-generated AI images is an emerging challenge as AIGC becomes pervasive in creative workflows. Existing watermarking methods (1) remain vulnerable to real-world adversarial threats, often forced to trade off between defenses against spoofing and removal attacks; and (2) cannot support semantic-level tamper localization. We introduce PAI, a training-free inherent watermarking framework for AIGC copyright protection, plug-and-play with diffusion-based AIGC services. PAI simultaneously provides three key functionalities: robust ownership verification, attack detection, and semantic-level tampering localization. Unlike existing inherent watermark methods that only embed watermarks at noise initialization of diffusion models, we design a novel key-conditioned deflection mechanism that subtly steers the denoising trajectory according to the user key. Such trajectory-level coupling further strengthens the semantic entanglement of identity and content, thereby further enhancing robustness against real-world threats. Moreover, we also provide a theoretical analysis proving that only the valid key can pass verification. Experiments across 12 attack methods show that PAI achieves 98.43\% verification accuracy, improving over SOTA methods by 37.25\% on average, and retains strong tampering localization performance even against advanced AIGC edits. Our code is available at \url{https://github.com/QingyuLiu/PAI}.

Cite

Text

Liu et al. "Attack-Resistant Watermarking for AIGC Image Forensics via Diffusion-Based Semantic Deflection." International Conference on Learning Representations, 2026.

Markdown

[Liu et al. "Attack-Resistant Watermarking for AIGC Image Forensics via Diffusion-Based Semantic Deflection." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/liu2026iclr-attackresistant/)

BibTeX

@inproceedings{liu2026iclr-attackresistant,
  title     = {{Attack-Resistant Watermarking for AIGC Image Forensics via Diffusion-Based Semantic Deflection}},
  author    = {Liu, Qingyu and Zhang, Yitao and Ba, Zhongjie and Shuai, Chao and Cheng, Peng and Zheng, Tianhang and Wang, Zhibo},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/liu2026iclr-attackresistant/}
}