Efficient Model Stealing Defense with Noise Transition Matrix

Abstract

With the escalating complexity and investment cost of training deep neural networks safeguarding them from unauthorized usage and intellectual property theft has become imperative. Especially the rampant misuse of prediction APIs to replicate models without access to the original data or architecture poses grave security threats. Diverse defense strategies have emerged to address these vulnerabilities yet these defenses either incur heavy inference overheads or assume idealized attack scenarios. To address these challenges we revisit the utilization of noise transition matrix as an efficient perturbation technique which injects noise into predicted posteriors in a linear manner and integrates seamlessly into existing systems with minimal overhead for model stealing defense. Provably with such perturbed posteriors the attacker's cloning process degrades into learning from noisy data. Toward optimizing the noise transition matrix we proposed a novel bi-level optimization training framework which performs fidelity on the victim model while the surrogate model adversarially. Comprehensive experimental results demonstrate that our method effectively thwarts model stealing attacks and achieves minimal utility trade-offs outperforming existing state-of-the-art defenses.

Cite

Text

Wu et al. "Efficient Model Stealing Defense with Noise Transition Matrix." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.02294

Markdown

[Wu et al. "Efficient Model Stealing Defense with Noise Transition Matrix." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/wu2024cvpr-efficient/) doi:10.1109/CVPR52733.2024.02294

BibTeX

@inproceedings{wu2024cvpr-efficient,
  title     = {{Efficient Model Stealing Defense with Noise Transition Matrix}},
  author    = {Wu, Dong-Dong and Fu, Chilin and Wu, Weichang and Xia, Wenwen and Zhang, Xiaolu and Zhou, Jun and Zhang, Min-Ling},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {24305-24315},
  doi       = {10.1109/CVPR52733.2024.02294},
  url       = {https://mlanthology.org/cvpr/2024/wu2024cvpr-efficient/}
}