Traceable Black-Box Watermarks for Federated Learning

Xu, Jiahao; Hu, Rui; Kotevska, Olivera; Zhang, Zikai

Traceable Black-Box Watermarks for Federated Learning

Jiahao Xu, Rui Hu, Olivera Kotevska, Zikai Zhang

ICLR 2026

/iclr/2026/xu2026iclr-traceable/

Abstract

Due to the distributed nature of Federated Learning (FL) systems, each local client has access to the global model, which poses a critical risk of model leakage. Existing works have explored injecting watermarks into local models to enable intellectual property protection. However, these methods either focus on non-traceable watermarks or traceable but white-box watermarks. We identify a gap in the literature regarding the formal definition of traceable black-box watermarking and the formulation of the problem of injecting such watermarks into FL systems. In this work, we first formalize the problem of injecting traceable black-box watermarks into FL. Based on the problem, we propose a novel server-side watermarking method, $\mathbf{TraMark}$, which creates a traceable watermarked model for each client, enabling verification of model leakage in black-box settings. To achieve this, $\mathbf{TraMark}$ partitions the model parameter space into two distinct regions: the main task region and the watermarking region. Subsequently, a personalized global model is constructed for each client by aggregating only the main task region while preserving the watermarking region. Each model then learns a unique watermark exclusively within the watermarking region using a distinct watermark dataset before being sent back to the local client. Extensive results across various FL systems demonstrate that $\mathbf{TraMark}$ ensures the traceability of all watermarked models while preserving their main task performance. The code is available at \url{https://github.com/JiiahaoXU/TraMark}.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Xu et al. "Traceable Black-Box Watermarks for Federated Learning." International Conference on Learning Representations, 2026.

Markdown

[Xu et al. "Traceable Black-Box Watermarks for Federated Learning." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/xu2026iclr-traceable/)

BibTeX

@inproceedings{xu2026iclr-traceable,
  title     = {{Traceable Black-Box Watermarks for Federated Learning}},
  author    = {Xu, Jiahao and Hu, Rui and Kotevska, Olivera and Zhang, Zikai},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/xu2026iclr-traceable/}
}