Probabilistic Safeguard for Reinforcement Learning Using Safety Index Guided Gaussian Process Models
Abstract
Safety is one of the biggest concerns to applying reinforcement learning (RL) to the physical world. In its core part, it is challenging to ensure RL agents persistently satisfy a hard state constraint without white-box or black-box dynamics models. This paper presents an integrated model learning and safe control framework to safeguard any RL agent, where the environment dynamics are learned as Gaussian processes. The proposed theory provides (i) a novel method to construct an offline dataset for model learning that best achieves safety requirements; (ii) a design rule to construct the safety index to ensure the existence of safe control under control limits; (iii) a probablistic safety guarantee (i.e. probabilistic forward invariance) when the model is learned using the aforementioned dataset. Simulation results show that our framework achieves almost zero safety violation on various continuous control tasks.
Cite
Text
Zhao et al. "Probabilistic Safeguard for Reinforcement Learning Using Safety Index Guided Gaussian Process Models." Proceedings of The 5th Annual Learning for Dynamics and Control Conference, 2023.Markdown
[Zhao et al. "Probabilistic Safeguard for Reinforcement Learning Using Safety Index Guided Gaussian Process Models." Proceedings of The 5th Annual Learning for Dynamics and Control Conference, 2023.](https://mlanthology.org/l4dc/2023/zhao2023l4dc-probabilistic/)BibTeX
@inproceedings{zhao2023l4dc-probabilistic,
title = {{Probabilistic Safeguard for Reinforcement Learning Using Safety Index Guided Gaussian Process Models}},
author = {Zhao, Weiye and He, Tairan and Liu, Changliu},
booktitle = {Proceedings of The 5th Annual Learning for Dynamics and Control Conference},
year = {2023},
pages = {783-796},
volume = {211},
url = {https://mlanthology.org/l4dc/2023/zhao2023l4dc-probabilistic/}
}