Multi-Bit, Black-Box Watermarking of Deep Neural Networks in Embedded Applications
Abstract
The effort required to collect data and train a large neural network requires a significant investment from organizations. Therefore, trained neural networks are often seen as valuable intellectual property that needs to be protected. At the same time, we are increasingly seeing applications where a model is deployed on an edge device. This has several benefits, including improved privacy and reduced latency but it also opens up the possibility for third parties to extract the trained model from the device and to use it for their own purposes. Watermarking techniques aim to safeguard neural networks from unauthorized usage. These methods alter the model’s behavior for specific trigger inputs, enabling the owner to recognize stolen instances. However, existing watermarking algorithms are not suited for distributed edge AI scenarios as they only create a single watermarked model instance. We introduce a novel multi-bit watermarking approach capable of efficiently generating a large number of model instances. Each of these instances maintains functional equivalence but exhibits unique behaviors when prompted with a predefined trigger input. This allows the owner to trace the source of a model leak to a potentially malicious user. We experimentally validate our approach on the MNIST, CIFAR-10, and ImageNet datasets, evaluating model performance and resilience against watermark removal attacks.
Cite
Text
Leroux et al. "Multi-Bit, Black-Box Watermarking of Deep Neural Networks in Embedded Applications." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00217Markdown
[Leroux et al. "Multi-Bit, Black-Box Watermarking of Deep Neural Networks in Embedded Applications." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/leroux2024cvprw-multibit/) doi:10.1109/CVPRW63382.2024.00217BibTeX
@inproceedings{leroux2024cvprw-multibit,
title = {{Multi-Bit, Black-Box Watermarking of Deep Neural Networks in Embedded Applications}},
author = {Leroux, Sam and Vanassche, Stijn and Simoens, Pieter},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2024},
pages = {2121-2130},
doi = {10.1109/CVPRW63382.2024.00217},
url = {https://mlanthology.org/cvprw/2024/leroux2024cvprw-multibit/}
}