Active Advice Seeking for Inverse Reinforcement Learning

Abstract

Intelligent systems that interact with humans typically require demonstrations and/or advice from the expert for optimal decision making. While the active learning formalism allows for these systems to incrementally acquire demonstrations from the human expert, most learning systems require all the advice about the domain in advance. We consider the problem of actively soliciting human advice in an inverse reinforcement learning setting where the utilities are learned from demonstrations. Our hypothesis is that such solicitation of advice reduces the burden on the human to provide advice about every scenario in advance.

Cite

Text

Odom and Natarajan. "Active Advice Seeking for Inverse Reinforcement Learning." AAAI Conference on Artificial Intelligence, 2015. doi:10.1609/AAAI.V29I1.9722

Markdown

[Odom and Natarajan. "Active Advice Seeking for Inverse Reinforcement Learning." AAAI Conference on Artificial Intelligence, 2015.](https://mlanthology.org/aaai/2015/odom2015aaai-active/) doi:10.1609/AAAI.V29I1.9722

BibTeX

@inproceedings{odom2015aaai-active,
  title     = {{Active Advice Seeking for Inverse Reinforcement Learning}},
  author    = {Odom, Phillip and Natarajan, Sriraam},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2015},
  pages     = {4186-4187},
  doi       = {10.1609/AAAI.V29I1.9722},
  url       = {https://mlanthology.org/aaai/2015/odom2015aaai-active/}
}