Enhancing Off-Policy Constrained Reinforcement Learning Through Adaptive Ensemble C Estimation

Abstract

In the domain of real-world agents, the application of Reinforcement Learning (RL) remains challenging due to the necessity for safety constraints. Previously, Constrained Reinforcement Learning (CRL) has predominantly focused on on-policy algorithms. Although these algorithms exhibit a degree of efficacy, their interactivity efficiency in real-world settings is sub-optimal, highlighting the demand for more efficient off-policy methods. However, off-policy CRL algorithms grapple with challenges in precise estimation of the C-function, particularly due to the fluctuations in the constrained Lagrange multiplier. Addressing this gap, our study focuses on the nuances of C-value estimation in off-policy CRL and introduces the Adaptive Ensemble C-learning (AEC) approach to reduce these inaccuracies. Building on state-of-the-art off-policy algorithms, we propose AEC-based CRL algorithms designed for enhanced task optimization. Extensive experiments on nine constrained robotics tasks reveal the superior interaction efficiency and performance of our algorithms in comparison to preceding methods.

Cite

Text

Zhang et al. "Enhancing Off-Policy Constrained Reinforcement Learning Through Adaptive Ensemble C Estimation." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I19.30177

Markdown

[Zhang et al. "Enhancing Off-Policy Constrained Reinforcement Learning Through Adaptive Ensemble C Estimation." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/zhang2024aaai-enhancing-a/) doi:10.1609/AAAI.V38I19.30177

BibTeX

@inproceedings{zhang2024aaai-enhancing-a,
  title     = {{Enhancing Off-Policy Constrained Reinforcement Learning Through Adaptive Ensemble C Estimation}},
  author    = {Zhang, Hengrui and Lin, Youfang and Shen, Shuo and Han, Sheng and Lv, Kai},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {21770-21778},
  doi       = {10.1609/AAAI.V38I19.30177},
  url       = {https://mlanthology.org/aaai/2024/zhang2024aaai-enhancing-a/}
}