Learning Against Opponents with Bounded Memory
Abstract
Recently, a number of authors have proposed criteria for evaluating learning algorithms in multiagent systems. While well-justified, each of these has generally given little attention to one of the main challenges of a multi-agent setting: the capability of the other agents to adapt and learn as well. We propose extending existing criteria to apply to a class of adaptive opponents with bounded memory. We then show an algorithm that provably achieves an ǫ-best response against this richer class of opponents while simultaneously guaranteeing a minimum payoff against any opponent and performing well in self-play. This new algorithm also demonstrates strong performance in empirical tests against a variety of opponents in a wide range of environments. 1
Cite
Text
Powers and Shoham. "Learning Against Opponents with Bounded Memory." International Joint Conference on Artificial Intelligence, 2005.Markdown
[Powers and Shoham. "Learning Against Opponents with Bounded Memory." International Joint Conference on Artificial Intelligence, 2005.](https://mlanthology.org/ijcai/2005/powers2005ijcai-learning/)BibTeX
@inproceedings{powers2005ijcai-learning,
title = {{Learning Against Opponents with Bounded Memory}},
author = {Powers, Rob and Shoham, Yoav},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2005},
pages = {817-822},
url = {https://mlanthology.org/ijcai/2005/powers2005ijcai-learning/}
}