Robust Audio Adversarial Example for a Physical Attack

Abstract

We propose a method to generate audio adversarial examples that can attack a state-of-the-art speech recognition model in the physical world. Previous work assumes that generated adversarial examples are directly fed to the recognition model, and is not able to perform such a physical attack because of reverberation and noise from playback environments. In contrast, our method obtains robust adversarial examples by simulating transformations caused by playback or recording in the physical world and incorporating the transformations into the generation process. Evaluation and a listening experiment demonstrated that our adversarial examples are able to attack without being noticed by humans. This result suggests that audio adversarial examples generated by the proposed method may become a real threat.

Cite

Text

Yakura and Sakuma. "Robust Audio Adversarial Example for a Physical Attack." International Joint Conference on Artificial Intelligence, 2019. doi:10.24963/IJCAI.2019/741

Markdown

[Yakura and Sakuma. "Robust Audio Adversarial Example for a Physical Attack." International Joint Conference on Artificial Intelligence, 2019.](https://mlanthology.org/ijcai/2019/yakura2019ijcai-robust/) doi:10.24963/IJCAI.2019/741

BibTeX

@inproceedings{yakura2019ijcai-robust,
  title     = {{Robust Audio Adversarial Example for a Physical Attack}},
  author    = {Yakura, Hiromu and Sakuma, Jun},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2019},
  pages     = {5334-5341},
  doi       = {10.24963/IJCAI.2019/741},
  url       = {https://mlanthology.org/ijcai/2019/yakura2019ijcai-robust/}
}