Augmenting Markov Decision Processes with Advising

Abstract

This paper introduces Advice-MDPs, an expansion of Markov Decision Processes for generating policies that take into consideration advising on the desirability, undesirability, and prohibition of certain states and actions. AdviceMDPs enable the design of designing semi-autonomous systems (systems that require operator support for at least handling certain situations) that can efficiently handle unexpected complex environments. Operators, through advising, can augment the planning model for covering unexpected real-world irregularities. This advising can swiftly augment the degree of autonomy of the system, so it can work without subsequent human intervention. This paper details the Advice-MDP formalism, a fast AdviceMDP resolution algorithm, and its applicability for real-world tasks, via the design of a professional-class semi-autonomous robot system ready to be deployed in a wide range of unexpected environments and capable of efficiently integrating operator advising.

Cite

Text

Vanhée et al. "Augmenting Markov Decision Processes with Advising." AAAI Conference on Artificial Intelligence, 2019. doi:10.1609/AAAI.V33I01.33012531

Markdown

[Vanhée et al. "Augmenting Markov Decision Processes with Advising." AAAI Conference on Artificial Intelligence, 2019.](https://mlanthology.org/aaai/2019/vanhee2019aaai-augmenting/) doi:10.1609/AAAI.V33I01.33012531

BibTeX

@inproceedings{vanhee2019aaai-augmenting,
  title     = {{Augmenting Markov Decision Processes with Advising}},
  author    = {Vanhée, Loïs and Jeanpierre, Laurent and Mouaddib, Abdel-Illah},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2019},
  pages     = {2531-2538},
  doi       = {10.1609/AAAI.V33I01.33012531},
  url       = {https://mlanthology.org/aaai/2019/vanhee2019aaai-augmenting/}
}