Safer Autonomous Driving in a Stochastic, Partially-Observable Environment by Hierarchical Contingency Planning
Abstract
When learning to act in a stochastic, partially observable environment, an intelligent agent should be prepared to anticipate a change in its belief of the environment state, and be capable of adapting its actions on-the-fly to changing conditions. As humans, we are able to form contingency plans when learning a task with the explicit aim of being able to correct errors in the initial control, and hence prove useful if ever there is a sudden change in our perception of the environment which requires immediate corrective action. This is especially the case for autonomous vehicles (AVs) navigating real-world situations where safety is paramount, and a strong ability to react to a changing belief about the environment is truly needed. In this paper we explore an end-to-end approach, from training to execution, for learning robust contingency plans and combining them with a hierarchical planner to obtain a robust agent policy in an autonomous navigation task where other vehicles’ behaviours are unknown, and the agent’s belief about these behaviours is subject to sudden, last-second change. We show that our approach results in robust, safe behaviour in a partially observable, stochastic environment, generalizing well over environment dynamics not seen during training.
Cite
Text
Lecerf et al. "Safer Autonomous Driving in a Stochastic, Partially-Observable Environment by Hierarchical Contingency Planning." ICLR 2022 Workshops: GPL, 2022.Markdown
[Lecerf et al. "Safer Autonomous Driving in a Stochastic, Partially-Observable Environment by Hierarchical Contingency Planning." ICLR 2022 Workshops: GPL, 2022.](https://mlanthology.org/iclrw/2022/lecerf2022iclrw-safer/)BibTeX
@inproceedings{lecerf2022iclrw-safer,
title = {{Safer Autonomous Driving in a Stochastic, Partially-Observable Environment by Hierarchical Contingency Planning}},
author = {Lecerf, Ugo and Yemdji-Tchassi, Christelle and Michiardi, Pietro},
booktitle = {ICLR 2022 Workshops: GPL},
year = {2022},
url = {https://mlanthology.org/iclrw/2022/lecerf2022iclrw-safer/}
}