Bayesian Nonparametrics Meets Data-Driven Distributionally Robust Optimization

Abstract

Training machine learning and statistical models often involves optimizing a data-driven risk criterion. The risk is usually computed with respect to the empirical data distribution, but this may result in poor and unstable out-of-sample performance due to distributional uncertainty. In the spirit of distributionally robust optimization, we propose a novel robust criterion by combining insights from Bayesian nonparametric (i.e., Dirichlet process) theory and a recent decision-theoretic model of smooth ambiguity-averse preferences. First, we highlight novel connections with standard regularized empirical risk minimization techniques, among which Ridge and LASSO regressions. Then, we theoretically demonstrate the existence of favorable finite-sample and asymptotic statistical guarantees on the performance of the robust optimization procedure. For practical implementation, we propose and study tractable approximations of the criterion based on well-known Dirichlet process representations. We also show that the smoothness of the criterion naturally leads to standard gradient-based numerical optimization. Finally, we provide insights into the workings of our method by applying it to a variety of tasks based on simulated and real datasets.

Cite

Text

Bariletto and Ho. "Bayesian Nonparametrics Meets Data-Driven Distributionally Robust Optimization." Neural Information Processing Systems, 2024. doi:10.52202/079017-1149

Markdown

[Bariletto and Ho. "Bayesian Nonparametrics Meets Data-Driven Distributionally Robust Optimization." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/bariletto2024neurips-bayesian/) doi:10.52202/079017-1149

BibTeX

@inproceedings{bariletto2024neurips-bayesian,
  title     = {{Bayesian Nonparametrics Meets Data-Driven Distributionally Robust Optimization}},
  author    = {Bariletto, Nicola and Ho, Nhat},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-1149},
  url       = {https://mlanthology.org/neurips/2024/bariletto2024neurips-bayesian/}
}