Contrasting Adversarial Perturbations: The Space of Harmless Perturbations

Abstract

Existing works have extensively studied adversarial examples, which are minimal perturbations that can mislead the output of deep neural networks (DNNs) while remaining imperceptible to humans. However, in this work, we reveal the existence of a harmless perturbation space, in which perturbations drawn from this space, regardless of their magnitudes, leave the network output unchanged when applied to inputs. Essentially, the harmless perturbation space emerges from the usage of non-injective functions (linear or non-linear layers) within DNNs, enabling multiple distinct inputs to be mapped to the same output. For linear layers with input dimensions exceeding output dimensions, any linear combination of the orthogonal bases of the nullspace of the parameter consistently yields no change in their output. For non-linear layers, the harmless perturbation space may expand, depending on the properties of the layers and input samples. Inspired by this property of DNNs, we solve for a family of general perturbation spaces that are redundant for the DNN's decision, and can be used to hide sensitive data and serve as a means of model identification. Our work highlights the distinctive robustness of DNNs (i.e., consistency under large magnitude perturbations) in contrast to adversarial examples (vulnerability for small noises).

Cite

Text

Chen et al. "Contrasting Adversarial Perturbations: The Space of Harmless Perturbations." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I2.32209

Markdown

[Chen et al. "Contrasting Adversarial Perturbations: The Space of Harmless Perturbations." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/chen2025aaai-contrasting/) doi:10.1609/AAAI.V39I2.32209

BibTeX

@inproceedings{chen2025aaai-contrasting,
  title     = {{Contrasting Adversarial Perturbations: The Space of Harmless Perturbations}},
  author    = {Chen, Lu and Li, Shaofeng and Huang, Benhao and Yang, Fan and Li, Zheng and Li, Jie and Luo, Yuan},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {2114-2122},
  doi       = {10.1609/AAAI.V39I2.32209},
  url       = {https://mlanthology.org/aaai/2025/chen2025aaai-contrasting/}
}