RINTAW: A Robust Invisible Watermark for Tabular Generative Models

Abstract

Watermarking tabular generative models is critical for preventing misuse of synthetic tabular data. However, existing watermarking methods for tabular data often lack robustness against common attacks (e.g., row shuffling) or are limited to specific data types (e.g., numerical), restricting their practical utility. To address these challenges, we propose \modelname, a novel watermarking framework for tabular generative models that is robust to common attacks while preserving data fidelity. \modelname embeds watermarks by leveraging a subset of column values as seeds. To ensure the pseudorandomness of the watermark key, \modelname employs an adaptive column selection strategy and a masking mechanism to enforce distribution uniformity. This approach guarantees minimal distortion to the original data distribution and is compatible with any tabular data format (numerical, categorical, or mixed) and generative model architecture. We validate \modelname on six real-world tabular datasets, demonstrating that the quality of watermarked tables remains nearly indistinguishable from non-watermarked ones while achieving high detectability even under strong post-editing attacks. The code is available at this \href{https://github.com/fangliancheng/RINTAW}link.

Cite

Text

Fang et al. "RINTAW: A Robust Invisible Watermark for Tabular Generative Models." ICLR 2025 Workshops: WMARK, 2025.

Markdown

[Fang et al. "RINTAW: A Robust Invisible Watermark for Tabular Generative Models." ICLR 2025 Workshops: WMARK, 2025.](https://mlanthology.org/iclrw/2025/fang2025iclrw-rintaw/)

BibTeX

@inproceedings{fang2025iclrw-rintaw,
  title     = {{RINTAW: A Robust Invisible Watermark for Tabular Generative Models}},
  author    = {Fang, Liancheng and Liu, Aiwei and Zou, Henry Peng and Zhang, Hengrui and Yu, Philip S.},
  booktitle = {ICLR 2025 Workshops: WMARK},
  year      = {2025},
  url       = {https://mlanthology.org/iclrw/2025/fang2025iclrw-rintaw/}
}