Why Adversarially Train Diffusion Models?

Briglia, Maria Rosaria; Mirza, Mujtaba Hussain; Lisanti, Giuseppe; Masi, Iacopo

Why Adversarially Train Diffusion Models?

Maria Rosaria Briglia, Mujtaba Hussain Mirza, Giuseppe Lisanti, Iacopo Masi

ICLR 2026

/iclr/2026/briglia2026iclr-adversarially/

Abstract

Adversarial Training (AT) is a known, powerful, well-established technique for improving classifier robustness to input perturbations, yet its applicability beyond discriminative settings remains limited. Motivated by the widespread use of score-based generative models and their need to operate robustly under substantial noisy or corrupted input data, we propose an adaptation of AT for these models, providing a thorough empirical assessment. We introduce a principled formulation of AT for Diffusion Models (DMs) that replaces the conventional *invariance* objective with an *equivariance* constraint aligned to the denoising dynamics of score matching. Our method integrates seamlessly into diffusion training by adding either random perturbations--similar to randomized smoothing--or adversarial ones--akin to AT. Our approach offers several advantages: **(a)** tolerance to heavy noise and corruption, **(b)** reduced memorization, **(c)** robustness to outliers and extreme data variability and **(d)** resilience to iterative adversarial attacks. We validate these claims on proof-of-concept low- and high-dimensional datasets with *known* ground-truth distributions, enabling precise error analysis. We further evaluate on standard benchmarks (CIFAR-10, CelebA, and LSUN Bedroom), where our approach shows improved robustness and preserved sample fidelity under severe noise, data corruption, and adversarial evaluation. Code available at [github.com/OmnAI-Lab/Adversarial-Training-DM](https://github.com/OmnAI-Lab/Adversarial-Training-DM)

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Briglia et al. "Why Adversarially Train Diffusion Models?." International Conference on Learning Representations, 2026.

Markdown

[Briglia et al. "Why Adversarially Train Diffusion Models?." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/briglia2026iclr-adversarially/)

BibTeX

@inproceedings{briglia2026iclr-adversarially,
  title     = {{Why Adversarially Train Diffusion Models?}},
  author    = {Briglia, Maria Rosaria and Mirza, Mujtaba Hussain and Lisanti, Giuseppe and Masi, Iacopo},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/briglia2026iclr-adversarially/}
}