Cyclic Equilibria in Markov Games

Abstract

Although variants of value iteration have been proposed for finding Nash or correlated equilibria in general-sum Markov games, these variants have not been shown to be effective in general. In this paper, we demon- strate by construction that existing variants of value iteration cannot find stationary equilibrium policies in arbitrary general-sum Markov games. Instead, we propose an alternative interpretation of the output of value it- eration based on a new (non-stationary) equilibrium concept that we call “cyclic equilibria.” We prove that value iteration identifies cyclic equi- libria in a class of games in which it fails to find stationary equilibria. We also demonstrate empirically that value iteration finds cyclic equilibria in nearly all examples drawn from a random distribution of Markov games.

Cite

Text

Zinkevich et al. "Cyclic Equilibria in Markov Games." Neural Information Processing Systems, 2005.

Markdown

[Zinkevich et al. "Cyclic Equilibria in Markov Games." Neural Information Processing Systems, 2005.](https://mlanthology.org/neurips/2005/zinkevich2005neurips-cyclic/)

BibTeX

@inproceedings{zinkevich2005neurips-cyclic,
  title     = {{Cyclic Equilibria in Markov Games}},
  author    = {Zinkevich, Martin and Greenwald, Amy and Littman, Michael L.},
  booktitle = {Neural Information Processing Systems},
  year      = {2005},
  pages     = {1641-1648},
  url       = {https://mlanthology.org/neurips/2005/zinkevich2005neurips-cyclic/}
}