Using Core Beliefs for Point-Based Value Iteration
Abstract
Recent research on point-based approximation algorithms for POMDPs demonstrated that good solutions to POMDP problems can be obtained without considering the entire belief simplex. For instance, the Point Based Value Iteration (PBVI) algorithm [Pineau et al., 2003] computes the value function only for a small set of belief states and iteratively adds more points to the set as needed. A key component of the algorithm is the strategy for selecting belief points, such that the space of reachable beliefs is well covered. This paper presents a new method for selecting an initial set of representative belief points, which relies on finding first the basis for the reachable belief simplex. Our approach has better worst-case performance than the original PBVI heuristic, and performs well in several standard POMDP tasks.
Cite
Text
Izadi et al. "Using Core Beliefs for Point-Based Value Iteration." International Joint Conference on Artificial Intelligence, 2005.Markdown
[Izadi et al. "Using Core Beliefs for Point-Based Value Iteration." International Joint Conference on Artificial Intelligence, 2005.](https://mlanthology.org/ijcai/2005/izadi2005ijcai-using/)BibTeX
@inproceedings{izadi2005ijcai-using,
title = {{Using Core Beliefs for Point-Based Value Iteration}},
author = {Izadi, Masoumeh T. and Rajwade, Ajit V. and Precup, Doina},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2005},
pages = {1751-1753},
url = {https://mlanthology.org/ijcai/2005/izadi2005ijcai-using/}
}