WaterDrum: Watermark-Based Data-Centric Unlearning Metric

Lu, Xinyang; Niu, Xinyuan; Lau, Gregory Kang Ruey; Bui, Nhung; Sim, Rachael Hwee Ling; Himawan, John Russell; Wen, Fanyu; Foo, Chuan-Sheng; Ng, See-Kiong; Low, Bryan Kian Hsiang

WaterDrum: Watermark-Based Data-Centric Unlearning Metric

Xinyang Lu, Xinyuan Niu, Gregory Kang Ruey Lau, Nhung Bui, Rachael Hwee Ling Sim, John Russell Himawan, Fanyu Wen, Chuan-Sheng Foo, See-Kiong Ng, Bryan Kian Hsiang Low

ICLR 2026

/iclr/2026/lu2026iclr-waterdrum/

Abstract

Large language model (LLM) unlearning is critical in real-world applications where it is necessary to efficiently remove the influence of private, copyrighted, or harmful data from some users. Existing utility-centric unlearning metrics (based on model utility) may fail to accurately evaluate the extent of unlearning in realistic settings such as when the forget and retain sets have semantically similar content and/or retraining the model from scratch on the retain set is impractical. This paper presents the first data-centric unlearning metric for LLMs called WaterDrum that exploits robust text watermarking to overcome these limitations. We introduce new benchmark datasets (with different levels of data similarity) for LLM unlearning that can be used to rigorously evaluate unlearning algorithms via WaterDrum. Our code is available on [Github](https://github.com/lululu008/WaterDrum) and our new benchmark datasets are released on [HuggingFace](https://huggingface.co/datasets/Glow-AI/WaterDrum-Ax).

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Lu et al. "WaterDrum: Watermark-Based Data-Centric Unlearning Metric." International Conference on Learning Representations, 2026.

Markdown

[Lu et al. "WaterDrum: Watermark-Based Data-Centric Unlearning Metric." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/lu2026iclr-waterdrum/)

BibTeX

@inproceedings{lu2026iclr-waterdrum,
  title     = {{WaterDrum: Watermark-Based Data-Centric Unlearning Metric}},
  author    = {Lu, Xinyang and Niu, Xinyuan and Lau, Gregory Kang Ruey and Bui, Nhung and Sim, Rachael Hwee Ling and Himawan, John Russell and Wen, Fanyu and Foo, Chuan-Sheng and Ng, See-Kiong and Low, Bryan Kian Hsiang},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/lu2026iclr-waterdrum/}
}