InverseNet: Augmenting Model Extraction Attacks with Training Data Inversion

Gong, Xueluan; Chen, Yanjiao; Yang, Wenbin; Mei, Guanghao; Wang, Qian

doi:10.24963/IJCAI.2021/336

InverseNet: Augmenting Model Extraction Attacks with Training Data Inversion

Xueluan Gong, Yanjiao Chen, Wenbin Yang, Guanghao Mei, Qian Wang

IJCAI 2021 pp. 2439-2447

doi:10.24963/IJCAI.2021/336 /ijcai/2021/gong2021ijcai-inversenet/

Abstract

Cloud service providers, including Google, Amazon, and Alibaba, have now launched machine-learning-as-a-service (MLaaS) platforms, allowing clients to access sophisticated cloud-based machine learning models via APIs. Unfortunately, however, the commercial value of these models makes them alluring targets for theft, and their strategic position as part of the IT infrastructure of many companies makes them an enticing springboard for conducting further adversarial attacks. In this paper, we put forth a novel and effective attack strategy, dubbed InverseNet, that steals the functionality of black-box cloud-based models with only a small number of queries. The crux of the innovation is that, unlike existing model extraction attacks that rely on public datasets or adversarial samples, InverseNet constructs inversed training samples to increase the similarity between the extracted substitute model and the victim model. Further, only a small number of data samples with high confidence scores (rather than an entire dataset) are used to reconstruct the inversed dataset, which substantially reduces the attack cost. Extensive experiments conducted on three simulated victim models and Alibaba Cloud's commercially-available API demonstrate that InverseNet yields a model with significantly greater functional similarity to the victim model than the current state-of-the-art attacks at a substantially lower query budget.

PDF IJCAI Semantic Scholar

Cite

Text

Gong et al. "InverseNet: Augmenting Model Extraction Attacks with Training Data Inversion." International Joint Conference on Artificial Intelligence, 2021. doi:10.24963/IJCAI.2021/336

Markdown

[Gong et al. "InverseNet: Augmenting Model Extraction Attacks with Training Data Inversion." International Joint Conference on Artificial Intelligence, 2021.](https://mlanthology.org/ijcai/2021/gong2021ijcai-inversenet/) doi:10.24963/IJCAI.2021/336

BibTeX

@inproceedings{gong2021ijcai-inversenet,
  title     = {{InverseNet: Augmenting Model Extraction Attacks with Training Data Inversion}},
  author    = {Gong, Xueluan and Chen, Yanjiao and Yang, Wenbin and Mei, Guanghao and Wang, Qian},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {2439-2447},
  doi       = {10.24963/IJCAI.2021/336},
  url       = {https://mlanthology.org/ijcai/2021/gong2021ijcai-inversenet/}
}