Agents Teaching Agents in Reinforcement Learning (Nectar Abstract)
Abstract
Using reinforcement learning [4] (RL), agents can autonomously learn a control policy to master sequential-decision tasks. Rather than always learning tabula rasa , our recent work [5,7,8] considers how an experienced RL agent, the teacher , can help another RL agent, the student , to learn. As a motivating example, consider a household robot that has learned to perform tasks in a household. When the consumer purchases a new robot, she would like the student robot to quickly learn to perform the same tasks as the teacher robot, even if the new robot has different state representation, learning method, or manufacturer. Our goals are to: 1) Allow the student to learn faster with the teacher than without it, 2) Allow the student and teacher to have different learning methods and knowledge representations, 3) Not limit the student’s performance when the teacher is sub-optimal, 4) Not require a complex, shared language, and 5) Limit the amount of communication required between the agents.
Cite
Text
Taylor and Torrey. "Agents Teaching Agents in Reinforcement Learning (Nectar Abstract)." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2014. doi:10.1007/978-3-662-44845-8_50Markdown
[Taylor and Torrey. "Agents Teaching Agents in Reinforcement Learning (Nectar Abstract)." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2014.](https://mlanthology.org/ecmlpkdd/2014/taylor2014ecmlpkdd-agents/) doi:10.1007/978-3-662-44845-8_50BibTeX
@inproceedings{taylor2014ecmlpkdd-agents,
title = {{Agents Teaching Agents in Reinforcement Learning (Nectar Abstract)}},
author = {Taylor, Matthew E. and Torrey, Lisa},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2014},
pages = {524-528},
doi = {10.1007/978-3-662-44845-8_50},
url = {https://mlanthology.org/ecmlpkdd/2014/taylor2014ecmlpkdd-agents/}
}