Non-Rectangular Robust MDPs with Normed Uncertainty Sets

Abstract

Robust policy evaluation for non-rectangular uncertainty set is generally NP-hard, even in approximation. Consequently, existing approaches suffer from either exponential iteration complexity or significant accuracy gaps. Interestingly, we identify a powerful class of $L_p$-bounded uncertainty sets that avoid these complexity barriers due to their structural simplicity. We further show that this class can be decomposed into infinitely many \texttt{sa}-rectangular $L_p$-bounded sets and leverage its structural properties to derive a novel dual formulation for $L_p$ robust Markov Decision Processes (MDPs). This formulation reveals key insights into the adversary’s strategy and leads to the \textbf{first polynomial-time robust policy evaluation algorithm} for $L_1$-normed non-rectangular robust MDPs.

Cite

Text

Kumar et al. "Non-Rectangular Robust MDPs with Normed  Uncertainty Sets." Advances in Neural Information Processing Systems, 2025.

Markdown

[Kumar et al. "Non-Rectangular Robust MDPs with Normed  Uncertainty Sets." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/kumar2025neurips-nonrectangular/)

BibTeX

@inproceedings{kumar2025neurips-nonrectangular,
  title     = {{Non-Rectangular Robust MDPs with Normed  Uncertainty Sets}},
  author    = {Kumar, Navdeep and Gupta, Adarsh and Elfatihi, Maxence Mohamed and Ramponi, Giorgia and Levy, Kfir Yehuda and Mannor, Shie},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/kumar2025neurips-nonrectangular/}
}