Augmenting Markov Decision Processes with Advising
Abstract
This paper introduces Advice-MDPs, an expansion of Markov Decision Processes for generating policies that take into consideration advising on the desirability, undesirability, and prohibition of certain states and actions. AdviceMDPs enable the design of designing semi-autonomous systems (systems that require operator support for at least handling certain situations) that can efficiently handle unexpected complex environments. Operators, through advising, can augment the planning model for covering unexpected real-world irregularities. This advising can swiftly augment the degree of autonomy of the system, so it can work without subsequent human intervention. This paper details the Advice-MDP formalism, a fast AdviceMDP resolution algorithm, and its applicability for real-world tasks, via the design of a professional-class semi-autonomous robot system ready to be deployed in a wide range of unexpected environments and capable of efficiently integrating operator advising.
Cite
Text
Vanhée et al. "Augmenting Markov Decision Processes with Advising." AAAI Conference on Artificial Intelligence, 2019. doi:10.1609/AAAI.V33I01.33012531Markdown
[Vanhée et al. "Augmenting Markov Decision Processes with Advising." AAAI Conference on Artificial Intelligence, 2019.](https://mlanthology.org/aaai/2019/vanhee2019aaai-augmenting/) doi:10.1609/AAAI.V33I01.33012531BibTeX
@inproceedings{vanhee2019aaai-augmenting,
title = {{Augmenting Markov Decision Processes with Advising}},
author = {Vanhée, Loïs and Jeanpierre, Laurent and Mouaddib, Abdel-Illah},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2019},
pages = {2531-2538},
doi = {10.1609/AAAI.V33I01.33012531},
url = {https://mlanthology.org/aaai/2019/vanhee2019aaai-augmenting/}
}