Hard-Label Based Small Query Black-Box Adversarial Attack

Abstract

We consider the hard-label based black-box adversarial attack setting which solely observes the target model's predicted class. Most of the attack methods in this setting suffer from impractical number of queries required to achieve a successful attack. One approach to tackle this drawback is utilising the adversarial transferability between white-box surrogate models and black-box target model. However, the majority of the methods adopting this approach are soft-label based to take the full advantage of zeroth-order optimisation. Unlike mainstream methods, we propose a new practical setting of hard-label based attack with an optimisation process guided by a pre-trained surrogate model. Experiments show the proposed method significantly improves the query efficiency of the hard-label based black-box attack across various target model architectures. We find the proposed method achieves approximately 5 times higher attack success rate compared to the benchmarks, especially at the small query budgets as 100 and 250.

Cite

Text

Park et al. "Hard-Label Based Small Query Black-Box Adversarial Attack." Winter Conference on Applications of Computer Vision, 2024.

Markdown

[Park et al. "Hard-Label Based Small Query Black-Box Adversarial Attack." Winter Conference on Applications of Computer Vision, 2024.](https://mlanthology.org/wacv/2024/park2024wacv-hardlabel/)

BibTeX

@inproceedings{park2024wacv-hardlabel,
  title     = {{Hard-Label Based Small Query Black-Box Adversarial Attack}},
  author    = {Park, Jeonghwan and Miller, Paul and McLaughlin, Niall},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2024},
  pages     = {3986-3995},
  url       = {https://mlanthology.org/wacv/2024/park2024wacv-hardlabel/}
}