Noise Tolerance of Distributionally Robust Learning

Abstract

Given the importance of building robust machine learning models, considerable efforts have recently been put into developing training strategies that achieve robustness to outliers and adversarial attacks. Yet, a major aspect that remains an open problem is systematic robustness to global forms of noise such as those that come from measurements and quantization. Hence, we propose in this work an approach to train regression models from data with additive forms of noise, leveraging the Wasserstein distance as a loss function. Importantly, our approach is agnostic to the model structure, unlike the increasingly popular Wasserstein Distributionally Robust Learning paradigm (WDRL) which, we show, does not achieve improved robustness when the regression function is not convex or Lipschitz. We provide a theoretical analysis of the scaling of the regression functions in terms of the variance of the noise, for both formulations and show consistency of the proposed loss function. Lastly, we conclude with numerical experiments on physical PDE Benchmarks and electric grid data, demonstrating competitive performance with an order of magnitude reduction in computational cost.

Cite

Text

Dakhmouche et al. "Noise Tolerance of Distributionally Robust Learning." International Conference on Learning Representations, 2026.

Markdown

[Dakhmouche et al. "Noise Tolerance of Distributionally Robust Learning." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/dakhmouche2026iclr-noise/)

BibTeX

@inproceedings{dakhmouche2026iclr-noise,
  title     = {{Noise Tolerance of Distributionally Robust Learning}},
  author    = {Dakhmouche, Ramzi and Lunati, Ivan and Gorji, Hossein},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/dakhmouche2026iclr-noise/}
}