Dynamic Teaching in Sequential Decision Making Environments

Walsh, Thomas J.; Goschin, Sergiu

Dynamic Teaching in Sequential Decision Making Environments

UAI 2012 pp. 863-872

/uai/2012/walsh2012uai-dynamic/

Abstract

We describe theoretical bounds and a practical algorithm for teaching a model by demonstration in a sequential decision making environment. Unlike previous efforts that have optimized learners that watch a teacher demonstrate a static policy, we focus on the teacher as a decision maker who can dynamically choose different policies to teach different parts of the environment. We develop several teaching frameworks based on previously defined supervised protocols, such as Teaching Dimension, extending them to handle noise and sequences of inputs encountered in an MDP.We provide theoretical bounds on the learnability of several important model classes in this setting and suggest a practical algorithm for dynamic teaching.

PDF UAI Semantic Scholar

Cite

Text

Walsh and Goschin. "Dynamic Teaching in Sequential Decision Making Environments." Conference on Uncertainty in Artificial Intelligence, 2012.

Markdown

[Walsh and Goschin. "Dynamic Teaching in Sequential Decision Making Environments." Conference on Uncertainty in Artificial Intelligence, 2012.](https://mlanthology.org/uai/2012/walsh2012uai-dynamic/)

BibTeX

@inproceedings{walsh2012uai-dynamic,
  title     = {{Dynamic Teaching in Sequential Decision Making Environments}},
  author    = {Walsh, Thomas J. and Goschin, Sergiu},
  booktitle = {Conference on Uncertainty in Artificial Intelligence},
  year      = {2012},
  pages     = {863-872},
  url       = {https://mlanthology.org/uai/2012/walsh2012uai-dynamic/}
}