Learning Against Opponents with Bounded Memory

Abstract

Recently, a number of authors have proposed criteria for evaluating learning algorithms in multiagent systems. While well-justified, each of these has generally given little attention to one of the main challenges of a multi-agent setting: the capability of the other agents to adapt and learn as well. We propose extending existing criteria to apply to a class of adaptive opponents with bounded memory. We then show an algorithm that provably achieves an ǫ-best response against this richer class of opponents while simultaneously guaranteeing a minimum payoff against any opponent and performing well in self-play. This new algorithm also demonstrates strong performance in empirical tests against a variety of opponents in a wide range of environments. 1

Cite

Text

Powers and Shoham. "Learning Against Opponents with Bounded Memory." International Joint Conference on Artificial Intelligence, 2005.

Markdown

[Powers and Shoham. "Learning Against Opponents with Bounded Memory." International Joint Conference on Artificial Intelligence, 2005.](https://mlanthology.org/ijcai/2005/powers2005ijcai-learning/)

BibTeX

@inproceedings{powers2005ijcai-learning,
  title     = {{Learning Against Opponents with Bounded Memory}},
  author    = {Powers, Rob and Shoham, Yoav},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2005},
  pages     = {817-822},
  url       = {https://mlanthology.org/ijcai/2005/powers2005ijcai-learning/}
}