Generalized Point Based Value Iteration for Interactive POMDPs
Abstract
We develop a point based method for solving finitely nested interactive POMDPs approximately. Analogously to point based value iteration (PBVI) in POMDPs, we maintain a set of belief points and form value functions composed of those value vectors that are optimal at these points. However, as we focus on multiagent settings, the beliefs are nested and computation of the value vectors relies on predicted actions of others. Consequently, we develop a novel interactive generalization of PBVI applicable to multiagent settings.
Cite
Text
Doshi and Perez. "Generalized Point Based Value Iteration for Interactive POMDPs." AAAI Conference on Artificial Intelligence, 2008.Markdown
[Doshi and Perez. "Generalized Point Based Value Iteration for Interactive POMDPs." AAAI Conference on Artificial Intelligence, 2008.](https://mlanthology.org/aaai/2008/doshi2008aaai-generalized/)BibTeX
@inproceedings{doshi2008aaai-generalized,
title = {{Generalized Point Based Value Iteration for Interactive POMDPs}},
author = {Doshi, Prashant and Perez, Dennis},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2008},
pages = {63-68},
url = {https://mlanthology.org/aaai/2008/doshi2008aaai-generalized/}
}