Probabilistically Robust Watermarking of Neural Networks
Abstract
Backdoor attacks in vertical federated learning (VFL) are particularly concerning as they can covertly compromise VFL decision-making, posing a severe threat to critical applications of VFL. Existing defense mechanisms typically involve either label obfuscation during training or model pruning during inference. However, the inherent limitations on the defender's access to the global model and complete training data in VFL environments fundamentally constrain the effectiveness of these conventional methods. To address these limitations, we propose the Universal Backdoor Defense (UBD) framework. UBD leverages Label Consistent Clustering (LCC) to synthesize plausible latent triggers associated with the backdoor class. This synthesized information is then utilized for mitigating backdoor threats through Linear Probing (LP), guided by a constraint on Batch Normalization (BN) statistics. Positioned within a unified VFL backdoor defense paradigm, UBD offers a generalized framework for both detection and mitigation that critically does not necessitate access to the entire model or dataset. Extensive experiments across multiple datasets rigorously demonstrate the efficacy of the UBD framework, achieving state-of-the-art performance against diverse backdoor attack types in VFL, including both dirty-label and clean-label variants.
Cite
Text
Pautov et al. "Probabilistically Robust Watermarking of Neural Networks." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/528Markdown
[Pautov et al. "Probabilistically Robust Watermarking of Neural Networks." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/pautov2024ijcai-probabilistically/) doi:10.24963/ijcai.2024/528BibTeX
@inproceedings{pautov2024ijcai-probabilistically,
title = {{Probabilistically Robust Watermarking of Neural Networks}},
author = {Pautov, Mikhail and Bogdanov, Nikita and Pyatkin, Stanislav and Rogov, Oleg and Oseledets, Ivan V.},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2024},
pages = {4778-4787},
doi = {10.24963/ijcai.2024/528},
url = {https://mlanthology.org/ijcai/2024/pautov2024ijcai-probabilistically/}
}