Enhancing Off-Policy Constrained Reinforcement Learning Through Adaptive Ensemble C Estimation
Abstract
In the domain of real-world agents, the application of Reinforcement Learning (RL) remains challenging due to the necessity for safety constraints. Previously, Constrained Reinforcement Learning (CRL) has predominantly focused on on-policy algorithms. Although these algorithms exhibit a degree of efficacy, their interactivity efficiency in real-world settings is sub-optimal, highlighting the demand for more efficient off-policy methods. However, off-policy CRL algorithms grapple with challenges in precise estimation of the C-function, particularly due to the fluctuations in the constrained Lagrange multiplier. Addressing this gap, our study focuses on the nuances of C-value estimation in off-policy CRL and introduces the Adaptive Ensemble C-learning (AEC) approach to reduce these inaccuracies. Building on state-of-the-art off-policy algorithms, we propose AEC-based CRL algorithms designed for enhanced task optimization. Extensive experiments on nine constrained robotics tasks reveal the superior interaction efficiency and performance of our algorithms in comparison to preceding methods.
Cite
Text
Zhang et al. "Enhancing Off-Policy Constrained Reinforcement Learning Through Adaptive Ensemble C Estimation." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I19.30177Markdown
[Zhang et al. "Enhancing Off-Policy Constrained Reinforcement Learning Through Adaptive Ensemble C Estimation." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/zhang2024aaai-enhancing-a/) doi:10.1609/AAAI.V38I19.30177BibTeX
@inproceedings{zhang2024aaai-enhancing-a,
title = {{Enhancing Off-Policy Constrained Reinforcement Learning Through Adaptive Ensemble C Estimation}},
author = {Zhang, Hengrui and Lin, Youfang and Shen, Shuo and Han, Sheng and Lv, Kai},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2024},
pages = {21770-21778},
doi = {10.1609/AAAI.V38I19.30177},
url = {https://mlanthology.org/aaai/2024/zhang2024aaai-enhancing-a/}
}