Your Text Encoder Can Be an Object-Level Watermarking Controller

Abstract

Invisible watermarking of AI-generated images can help with copyright protection, enabling detection and identification of AI-generated media. In this work, we present a novel approach to watermark images of T2I Latent Diffusion Models (LDMs). By only fine-tuning text token embeddings \mathcal W _*, we enable watermarking in selected objects or parts of the image, offering greater flexibility compared to traditional full-image watermarking. Our method leverages the text encoder's compatibility across various LDMs, allowing plug-and-play integration for different LDMs. Moreover, introducing the watermark early in the encoding stage improves robustness to adversarial perturbations in later stages of the pipeline. Our approach achieves 99% bit accuracy (48 bits) with a 10^5 xreduction in model parameters, enabling efficient watermarking.

Cite

Text

Devulapally et al. "Your Text Encoder Can Be an Object-Level Watermarking Controller." International Conference on Computer Vision, 2025.

Markdown

[Devulapally et al. "Your Text Encoder Can Be an Object-Level Watermarking Controller." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/devulapally2025iccv-your/)

BibTeX

@inproceedings{devulapally2025iccv-your,
  title     = {{Your Text Encoder Can Be an Object-Level Watermarking Controller}},
  author    = {Devulapally, Naresh Kumar and Huang, Mingzhen and Asnani, Vishal and Agarwal, Shruti and Lyu, Siwei and Lokhande, Vishnu Suresh},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {16576-16585},
  url       = {https://mlanthology.org/iccv/2025/devulapally2025iccv-your/}
}