Cyclic Equilibria in Markov Games
Abstract
Although variants of value iteration have been proposed for finding Nash or correlated equilibria in general-sum Markov games, these variants have not been shown to be effective in general. In this paper, we demon- strate by construction that existing variants of value iteration cannot find stationary equilibrium policies in arbitrary general-sum Markov games. Instead, we propose an alternative interpretation of the output of value it- eration based on a new (non-stationary) equilibrium concept that we call “cyclic equilibria.” We prove that value iteration identifies cyclic equi- libria in a class of games in which it fails to find stationary equilibria. We also demonstrate empirically that value iteration finds cyclic equilibria in nearly all examples drawn from a random distribution of Markov games.
Cite
Text
Zinkevich et al. "Cyclic Equilibria in Markov Games." Neural Information Processing Systems, 2005.Markdown
[Zinkevich et al. "Cyclic Equilibria in Markov Games." Neural Information Processing Systems, 2005.](https://mlanthology.org/neurips/2005/zinkevich2005neurips-cyclic/)BibTeX
@inproceedings{zinkevich2005neurips-cyclic,
title = {{Cyclic Equilibria in Markov Games}},
author = {Zinkevich, Martin and Greenwald, Amy and Littman, Michael L.},
booktitle = {Neural Information Processing Systems},
year = {2005},
pages = {1641-1648},
url = {https://mlanthology.org/neurips/2005/zinkevich2005neurips-cyclic/}
}