Implicit Imitation in Multiagent Reinforcement Learning
Abstract
Imitation is actively being studied as an effective means of learning in multi-agent environments. It allows an agent to learn how to act well (perhaps optimally) by observing the actions of cooperative teachers or more experienced agents. We propose a straightforward imitation mechanism called model extraction that can be integrated easily into standard model-based reinforcement learning algorithms. Roughly, by observing a mentor with similar capabilities, an agent can extract information about its own capabilities in unvisited parts of state space. The extracted information can accelerate learning dramatically. We illustrate the benefits of model extraction by integrating it with prioritized sweeping, and demonstrating improved performance and convergence through observation of single and multiple mentors. Though we make some stringent assumptions regarding observability, possible interactions and common abilities, we briefly comment on extensions to the model that relax these. Key...
Cite
Text
Price and Boutilier. "Implicit Imitation in Multiagent Reinforcement Learning." International Conference on Machine Learning, 1999.Markdown
[Price and Boutilier. "Implicit Imitation in Multiagent Reinforcement Learning." International Conference on Machine Learning, 1999.](https://mlanthology.org/icml/1999/price1999icml-implicit/)BibTeX
@inproceedings{price1999icml-implicit,
title = {{Implicit Imitation in Multiagent Reinforcement Learning}},
author = {Price, Bob and Boutilier, Craig},
booktitle = {International Conference on Machine Learning},
year = {1999},
pages = {325-334},
url = {https://mlanthology.org/icml/1999/price1999icml-implicit/}
}