Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy

Abstract

To mitigate potential risks associated with language models (LMs), recent AI detection research proposes incorporating watermarks into machine-generated text through random vocabulary restrictions and utilizing this information for detection. In this paper, we show that watermarking algorithms designed for LMs cannot be seamlessly applied to conditional text generation (CTG) tasks without a notable decline in downstream task performance. To address this issue, we introduce a simple yet effective semantic-aware watermarking algorithm that considers the characteristics of conditional text generation with the input context. Compared to the baseline watermarks, our proposed watermark yields significant improvements in both automatic and human evaluations across various text generation models, including BART and Flan-T5, for CTG tasks such as summarization and data-to-text generation. Meanwhile, it maintains detection ability with higher z-scores but lower AUC scores, suggesting the presence of a detection paradox that poses additional challenges for watermarking CTG.

Cite

Text

Fu et al. "Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I16.29756

Markdown

[Fu et al. "Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/fu2024aaai-watermarking/) doi:10.1609/AAAI.V38I16.29756

BibTeX

@inproceedings{fu2024aaai-watermarking,
  title     = {{Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy}},
  author    = {Fu, Yu and Xiong, Deyi and Dong, Yue},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {18003-18011},
  doi       = {10.1609/AAAI.V38I16.29756},
  url       = {https://mlanthology.org/aaai/2024/fu2024aaai-watermarking/}
}