Safe Resetless Reinforcement Learning: Enhancing Training Autonomy with Risk-Averse Agents
Abstract
Training Reinforcement Learning agents directly in any real-world environment remains difficult, as such scenarios entail the risk of damaging the training setup or violating other safety constraints. The training process itself further requires extensive human supervision and intervention to reset the environment after each episode. Thus, we propose an innovative Safe Reinforcement Learning framework that combines Safe and Resetless RL to autonomously reset environments, while also reducing the number of safety constraint violations. In this context, we develop a novel risk-averse RL agent suitable for stringent safety constraints by combining Safe RL, Distributional RL, and Randomized Ensembled Double Q-Learning. Experiments conducted in a novel mobile robotics scenario indicate that our Safe Resetless RL framework reduces the number of human interactions required during training compared to state-of-the-art methods, filling a gap in current problem formulations and enhancing the autonomy of RL training processes in real-world settings.
Cite
Text
Gottwald et al. "Safe Resetless Reinforcement Learning: Enhancing Training Autonomy with Risk-Averse Agents." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-92591-7_7Markdown
[Gottwald et al. "Safe Resetless Reinforcement Learning: Enhancing Training Autonomy with Risk-Averse Agents." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/gottwald2024eccvw-safe/) doi:10.1007/978-3-031-92591-7_7BibTeX
@inproceedings{gottwald2024eccvw-safe,
title = {{Safe Resetless Reinforcement Learning: Enhancing Training Autonomy with Risk-Averse Agents}},
author = {Gottwald, Tristan and Schier, Maximilian and Rosenhahn, Bodo},
booktitle = {European Conference on Computer Vision Workshops},
year = {2024},
pages = {100-116},
doi = {10.1007/978-3-031-92591-7_7},
url = {https://mlanthology.org/eccvw/2024/gottwald2024eccvw-safe/}
}