Dynamic Programming for Partially Observable Stochastic Games

Hansen, Eric A.; Bernstein, Daniel S.; Zilberstein, Shlomo

Dynamic Programming for Partially Observable Stochastic Games

Eric A. Hansen, Daniel S. Bernstein, Shlomo Zilberstein

AAAI 2004 pp. 709-715

/aaai/2004/hansen2004aaai-dynamic/

Abstract

We develop an exact dynamic programming algorithm for partially observable stochastic games (POSGs). The algorithm is a synthesis of dynamic programming for partially observable Markov decision processes (POMDPs) and iterated elimination of dominated strategies in normal form games. We prove that when applied to finite-horizon POSGs, the algorithm iteratively eliminates very weakly dominated strategies without first forming a normal form representation of the game. For the special case in which agents share the same payoffs, the algorithm can be used to find an optimal solution. We present preliminary empirical results and discuss ways to further exploit POMDP theory in solving POSGs. 1.

PDF AAAI Semantic Scholar

Cite

Text

Hansen et al. "Dynamic Programming for Partially Observable Stochastic Games." AAAI Conference on Artificial Intelligence, 2004.

Markdown

[Hansen et al. "Dynamic Programming for Partially Observable Stochastic Games." AAAI Conference on Artificial Intelligence, 2004.](https://mlanthology.org/aaai/2004/hansen2004aaai-dynamic/)

BibTeX

@inproceedings{hansen2004aaai-dynamic,
  title     = {{Dynamic Programming for Partially Observable Stochastic Games}},
  author    = {Hansen, Eric A. and Bernstein, Daniel S. and Zilberstein, Shlomo},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2004},
  pages     = {709-715},
  url       = {https://mlanthology.org/aaai/2004/hansen2004aaai-dynamic/}
}