InverseNet: Augmenting Model Extraction Attacks with Training Data Inversion
Abstract
Cloud service providers, including Google, Amazon, and Alibaba, have now launched machine-learning-as-a-service (MLaaS) platforms, allowing clients to access sophisticated cloud-based machine learning models via APIs. Unfortunately, however, the commercial value of these models makes them alluring targets for theft, and their strategic position as part of the IT infrastructure of many companies makes them an enticing springboard for conducting further adversarial attacks. In this paper, we put forth a novel and effective attack strategy, dubbed InverseNet, that steals the functionality of black-box cloud-based models with only a small number of queries. The crux of the innovation is that, unlike existing model extraction attacks that rely on public datasets or adversarial samples, InverseNet constructs inversed training samples to increase the similarity between the extracted substitute model and the victim model. Further, only a small number of data samples with high confidence scores (rather than an entire dataset) are used to reconstruct the inversed dataset, which substantially reduces the attack cost. Extensive experiments conducted on three simulated victim models and Alibaba Cloud's commercially-available API demonstrate that InverseNet yields a model with significantly greater functional similarity to the victim model than the current state-of-the-art attacks at a substantially lower query budget.
Cite
Text
Gong et al. "InverseNet: Augmenting Model Extraction Attacks with Training Data Inversion." International Joint Conference on Artificial Intelligence, 2021. doi:10.24963/IJCAI.2021/336Markdown
[Gong et al. "InverseNet: Augmenting Model Extraction Attacks with Training Data Inversion." International Joint Conference on Artificial Intelligence, 2021.](https://mlanthology.org/ijcai/2021/gong2021ijcai-inversenet/) doi:10.24963/IJCAI.2021/336BibTeX
@inproceedings{gong2021ijcai-inversenet,
title = {{InverseNet: Augmenting Model Extraction Attacks with Training Data Inversion}},
author = {Gong, Xueluan and Chen, Yanjiao and Yang, Wenbin and Mei, Guanghao and Wang, Qian},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2021},
pages = {2439-2447},
doi = {10.24963/IJCAI.2021/336},
url = {https://mlanthology.org/ijcai/2021/gong2021ijcai-inversenet/}
}