COCO-Q: Learning in Stochastic Games with Side Payments

Sodomka, Eric; Hilliard, Elizabeth; Littman, Michael; Greenwald, Amy

COCO-Q: Learning in Stochastic Games with Side Payments

Eric Sodomka, Elizabeth Hilliard, Michael Littman, Amy Greenwald

ICML 2013 pp. 1471-1479

/icml/2013/sodomka2013icml-cocoq/

Abstract

Coco (""cooperative/competitive"") values are a solution concept for two-player normal-form games with transferable utility, when binding agreements and side payments between players are possible. In this paper, we show that coco values can also be defined for stochastic games and can be learned using a simple variant of Q-learning that is provably convergent. We provide a set of examples showing how the strategies learned by the Coco-Q algorithm relate to those learned by existing multiagent Q-learning algorithms.

PDF ICML Semantic Scholar

Cite

Text

Sodomka et al. "COCO-Q: Learning in Stochastic Games with Side Payments." International Conference on Machine Learning, 2013.

Markdown

[Sodomka et al. "COCO-Q: Learning in Stochastic Games with Side Payments." International Conference on Machine Learning, 2013.](https://mlanthology.org/icml/2013/sodomka2013icml-cocoq/)

BibTeX

@inproceedings{sodomka2013icml-cocoq,
  title     = {{COCO-Q: Learning in Stochastic Games with Side Payments}},
  author    = {Sodomka, Eric and Hilliard, Elizabeth and Littman, Michael and Greenwald, Amy},
  booktitle = {International Conference on Machine Learning},
  year      = {2013},
  pages     = {1471-1479},
  volume    = {28},
  url       = {https://mlanthology.org/icml/2013/sodomka2013icml-cocoq/}
}