Generalized Point Based Value Iteration for Interactive POMDPs

Doshi, Prashant; Perez, Dennis

Generalized Point Based Value Iteration for Interactive POMDPs

AAAI 2008 pp. 63-68

/aaai/2008/doshi2008aaai-generalized/

Abstract

We develop a point based method for solving finitely nested interactive POMDPs approximately. Analogously to point based value iteration (PBVI) in POMDPs, we maintain a set of belief points and form value functions composed of those value vectors that are optimal at these points. However, as we focus on multiagent settings, the beliefs are nested and computation of the value vectors relies on predicted actions of others. Consequently, we develop a novel interactive generalization of PBVI applicable to multiagent settings.

PDF AAAI Semantic Scholar

Cite

Text

Doshi and Perez. "Generalized Point Based Value Iteration for Interactive POMDPs." AAAI Conference on Artificial Intelligence, 2008.

Markdown

[Doshi and Perez. "Generalized Point Based Value Iteration for Interactive POMDPs." AAAI Conference on Artificial Intelligence, 2008.](https://mlanthology.org/aaai/2008/doshi2008aaai-generalized/)

BibTeX

@inproceedings{doshi2008aaai-generalized,
  title     = {{Generalized Point Based Value Iteration for Interactive POMDPs}},
  author    = {Doshi, Prashant and Perez, Dennis},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2008},
  pages     = {63-68},
  url       = {https://mlanthology.org/aaai/2008/doshi2008aaai-generalized/}
}