Non-Rectangular Robust MDPs with Normed Uncertainty Sets
Abstract
Robust policy evaluation for non-rectangular uncertainty set is generally NP-hard, even in approximation. Consequently, existing approaches suffer from either exponential iteration complexity or significant accuracy gaps. Interestingly, we identify a powerful class of $L_p$-bounded uncertainty sets that avoid these complexity barriers due to their structural simplicity. We further show that this class can be decomposed into infinitely many \texttt{sa}-rectangular $L_p$-bounded sets and leverage its structural properties to derive a novel dual formulation for $L_p$ robust Markov Decision Processes (MDPs). This formulation reveals key insights into the adversary’s strategy and leads to the \textbf{first polynomial-time robust policy evaluation algorithm} for $L_1$-normed non-rectangular robust MDPs.
Cite
Text
Kumar et al. "Non-Rectangular Robust MDPs with Normed Uncertainty Sets." Advances in Neural Information Processing Systems, 2025.Markdown
[Kumar et al. "Non-Rectangular Robust MDPs with Normed Uncertainty Sets." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/kumar2025neurips-nonrectangular/)BibTeX
@inproceedings{kumar2025neurips-nonrectangular,
title = {{Non-Rectangular Robust MDPs with Normed Uncertainty Sets}},
author = {Kumar, Navdeep and Gupta, Adarsh and Elfatihi, Maxence Mohamed and Ramponi, Giorgia and Levy, Kfir Yehuda and Mannor, Shie},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/kumar2025neurips-nonrectangular/}
}