Attention-Imperceptible Backdoor Attacks on Vision Transformers
Abstract
With the successful transition of Transformers from natural language processing (NLP) to computer vision (CV) domains, Vision Transformers (ViTs) have achieved state-of-the-art performance in many CV tasks. However, backdoor attacks, a significant threat in deep learning, also pose a risk to the security of ViT models. Recently, several backdoor attack methods targeting the patch-level self-attention mechanism in ViTs have been proposed, but they are relatively naive in terms of stealthiness and robustness against defensive measures, lacking in-depth investigation. In this paper, we explore the crucial role of attention-level imperceptibility in backdoor attacks for ViTs and propose an Attention-Imperceptible Backdoor Attacks on Vision Transformers (AIBA). In AIBA, a constrained adversarial perturbation is used as the trigger to achieve visual imperceptibility. Additionally, the trigger is designed to seamlessly implant into the focal areas of the image, ensuring that the trigger receives enough attention from the model without causing anomalies at the attention level. During the backdoor learning process, we designed an efficient constrained bi-level optimization training strategy at the mini-batch level to implant an effective backdoor in the victim model using the imperceptible trigger. We evaluated the effectiveness of the proposed AIBA across multiple datasets and ViT benchmarks and explored the robustness of AIBA against current ViT-specific defense methods. The experimental results demonstrate that our backdoor attack method can successfully implant a powerful and stealthy backdoor into ViTs.
Cite
Text
Wang et al. "Attention-Imperceptible Backdoor Attacks on Vision Transformers." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I8.32889Markdown
[Wang et al. "Attention-Imperceptible Backdoor Attacks on Vision Transformers." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/wang2025aaai-attention/) doi:10.1609/AAAI.V39I8.32889BibTeX
@inproceedings{wang2025aaai-attention,
title = {{Attention-Imperceptible Backdoor Attacks on Vision Transformers}},
author = {Wang, Zhishen and Wang, Rui and Jing, Lihua},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {8241-8249},
doi = {10.1609/AAAI.V39I8.32889},
url = {https://mlanthology.org/aaai/2025/wang2025aaai-attention/}
}