Using Core Beliefs for Point-Based Value Iteration

Izadi, Masoumeh T.; Rajwade, Ajit V.; Precup, Doina

Using Core Beliefs for Point-Based Value Iteration

Masoumeh T. Izadi, Ajit V. Rajwade, Doina Precup

IJCAI 2005 pp. 1751-1753

/ijcai/2005/izadi2005ijcai-using/

Abstract

Recent research on point-based approximation algorithms for POMDPs demonstrated that good solutions to POMDP problems can be obtained without considering the entire belief simplex. For instance, the Point Based Value Iteration (PBVI) algorithm [Pineau et al., 2003] computes the value function only for a small set of belief states and iteratively adds more points to the set as needed. A key component of the algorithm is the strategy for selecting belief points, such that the space of reachable beliefs is well covered. This paper presents a new method for selecting an initial set of representative belief points, which relies on finding first the basis for the reachable belief simplex. Our approach has better worst-case performance than the original PBVI heuristic, and performs well in several standard POMDP tasks.

PDF Semantic Scholar

Cite

Text

Izadi et al. "Using Core Beliefs for Point-Based Value Iteration." International Joint Conference on Artificial Intelligence, 2005.

Markdown

[Izadi et al. "Using Core Beliefs for Point-Based Value Iteration." International Joint Conference on Artificial Intelligence, 2005.](https://mlanthology.org/ijcai/2005/izadi2005ijcai-using/)

BibTeX

@inproceedings{izadi2005ijcai-using,
  title     = {{Using Core Beliefs for Point-Based Value Iteration}},
  author    = {Izadi, Masoumeh T. and Rajwade, Ajit V. and Precup, Doina},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2005},
  pages     = {1751-1753},
  url       = {https://mlanthology.org/ijcai/2005/izadi2005ijcai-using/}
}