Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness

Abstract

Diffusion models (DMs) have demonstrated great potential in the field of adversarial robustness, where DM-based defense methods can achieve superior defense capability without adversarial training. However, they all require huge computational costs due to the usage of large-scale pre-trained DMs, making it difficult to conduct full evaluation under strong attacks and compare with traditional CNN-based methods. Simply reducing the network size and timesteps in DMs could significantly harm the image generation quality, which invalidates previous frameworks. To alleviate this issue, we redesign the diffusion framework from generating high-quality images to predicting distinguishable image labels. Specifically, we employ an image translation framework to learn many-to-one mapping from input samples to designed orthogonal image labels. Based on this framework, we introduce an efficient Image-to-Image diffusion classifier with a pruned U-Net structure and reduced diffusion timesteps. Besides the framework, we redesign the optimization objective of DMs to fit the target of image classification, where a new classification loss is incorporated in the DM-based image translation framework to distinguish the generated label from those of other classes. We conduct sufficient evaluations of the proposed classifier under various attacks on popular benchmarks. Extensive experiments show that our method achieves better adversarial robustness with fewer computational costs than DM-based and CNN-based methods.

Cite

Text

Mei et al. "Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I6.32650

Markdown

[Mei et al. "Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/mei2025aaai-efficient/) doi:10.1609/AAAI.V39I6.32650

BibTeX

@inproceedings{mei2025aaai-efficient,
  title     = {{Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness}},
  author    = {Mei, Hefei and Dong, Minjing and Xu, Chang},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {6081-6089},
  doi       = {10.1609/AAAI.V39I6.32650},
  url       = {https://mlanthology.org/aaai/2025/mei2025aaai-efficient/}
}