Probabilistically Robust Watermarking of Neural Networks

Pautov, Mikhail; Bogdanov, Nikita; Pyatkin, Stanislav; Rogov, Oleg; Oseledets, Ivan V.

doi:10.24963/ijcai.2024/528

Probabilistically Robust Watermarking of Neural Networks

Mikhail Pautov, Nikita Bogdanov, Stanislav Pyatkin, Oleg Rogov, Ivan V. Oseledets

IJCAI 2024 pp. 4778-4787

doi:10.24963/ijcai.2024/528 /ijcai/2024/pautov2024ijcai-probabilistically/

Abstract

Backdoor attacks in vertical federated learning (VFL) are particularly concerning as they can covertly compromise VFL decision-making, posing a severe threat to critical applications of VFL. Existing defense mechanisms typically involve either label obfuscation during training or model pruning during inference. However, the inherent limitations on the defender's access to the global model and complete training data in VFL environments fundamentally constrain the effectiveness of these conventional methods. To address these limitations, we propose the Universal Backdoor Defense (UBD) framework. UBD leverages Label Consistent Clustering (LCC) to synthesize plausible latent triggers associated with the backdoor class. This synthesized information is then utilized for mitigating backdoor threats through Linear Probing (LP), guided by a constraint on Batch Normalization (BN) statistics. Positioned within a unified VFL backdoor defense paradigm, UBD offers a generalized framework for both detection and mitigation that critically does not necessitate access to the entire model or dataset. Extensive experiments across multiple datasets rigorously demonstrate the efficacy of the UBD framework, achieving state-of-the-art performance against diverse backdoor attack types in VFL, including both dirty-label and clean-label variants.

PDF IJCAI Semantic Scholar

Cite

Text

Pautov et al. "Probabilistically Robust Watermarking of Neural Networks." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/528

Markdown

[Pautov et al. "Probabilistically Robust Watermarking of Neural Networks." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/pautov2024ijcai-probabilistically/) doi:10.24963/ijcai.2024/528

BibTeX

@inproceedings{pautov2024ijcai-probabilistically,
  title     = {{Probabilistically Robust Watermarking of Neural Networks}},
  author    = {Pautov, Mikhail and Bogdanov, Nikita and Pyatkin, Stanislav and Rogov, Oleg and Oseledets, Ivan V.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {4778-4787},
  doi       = {10.24963/ijcai.2024/528},
  url       = {https://mlanthology.org/ijcai/2024/pautov2024ijcai-probabilistically/}
}