Hard-Label Based Small Query Black-Box Adversarial Attack
Abstract
We consider the hard-label based black-box adversarial attack setting which solely observes the target model's predicted class. Most of the attack methods in this setting suffer from impractical number of queries required to achieve a successful attack. One approach to tackle this drawback is utilising the adversarial transferability between white-box surrogate models and black-box target model. However, the majority of the methods adopting this approach are soft-label based to take the full advantage of zeroth-order optimisation. Unlike mainstream methods, we propose a new practical setting of hard-label based attack with an optimisation process guided by a pre-trained surrogate model. Experiments show the proposed method significantly improves the query efficiency of the hard-label based black-box attack across various target model architectures. We find the proposed method achieves approximately 5 times higher attack success rate compared to the benchmarks, especially at the small query budgets as 100 and 250.
Cite
Text
Park et al. "Hard-Label Based Small Query Black-Box Adversarial Attack." Winter Conference on Applications of Computer Vision, 2024.Markdown
[Park et al. "Hard-Label Based Small Query Black-Box Adversarial Attack." Winter Conference on Applications of Computer Vision, 2024.](https://mlanthology.org/wacv/2024/park2024wacv-hardlabel/)BibTeX
@inproceedings{park2024wacv-hardlabel,
title = {{Hard-Label Based Small Query Black-Box Adversarial Attack}},
author = {Park, Jeonghwan and Miller, Paul and McLaughlin, Niall},
booktitle = {Winter Conference on Applications of Computer Vision},
year = {2024},
pages = {3986-3995},
url = {https://mlanthology.org/wacv/2024/park2024wacv-hardlabel/}
}