Efficient Action Robust Reinforcement Learning with Probabilistic Policy Execution Uncertainty

Abstract

Robust reinforcement learning (RL) aims to find a policy that optimizes the worst-case performance in the face of uncertainties. In this paper, we focus on action robust RL with the probabilistic policy execution uncertainty, in which, instead of always carrying out the action specified by the policy, the agent will take the action specified by the policy with probability $1-\rho$ and an alternative adversarial action with probability $\rho$. We show the existence of an optimal policy on the action robust MDPs with probabilistic policy execution uncertainty and provide the action robust Bellman optimality equation for its solution. Based on that, we develop Action Robust Reinforcement Learning with Certificates (ARRLC) algorithm that achieves minimax optimal regret and sample complexity. Our results highlight that action-robust RL shares the same sample complexity barriers as standard RL, ensuring robust performance without additional complexity costs. Furthermore, we conduct numerical experiments to validate our approach's robustness, demonstrating that ARRLC outperforms non-robust RL algorithms and converges faster than the other action robust RL algorithms in the presence of action perturbations.

Cite

Text

Liu et al. "Efficient Action Robust Reinforcement Learning with Probabilistic Policy Execution Uncertainty." Transactions on Machine Learning Research, 2024.

Markdown

[Liu et al. "Efficient Action Robust Reinforcement Learning with Probabilistic Policy Execution Uncertainty." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/liu2024tmlr-efficient/)

BibTeX

@article{liu2024tmlr-efficient,
  title     = {{Efficient Action Robust Reinforcement Learning with Probabilistic Policy Execution Uncertainty}},
  author    = {Liu, Guanlin and Zhou, Zhihan and Liu, Han and Lai, Lifeng},
  journal   = {Transactions on Machine Learning Research},
  year      = {2024},
  url       = {https://mlanthology.org/tmlr/2024/liu2024tmlr-efficient/}
}