Value Iteration Working with Belief Subset

Zhang, Weihong; Zhang, Nevin Lianwen

doi:10.5555/777092.777142

Value Iteration Working with Belief Subset

Weihong Zhang, Nevin Lianwen Zhang

AAAI 2002 pp. 307-313

doi:10.5555/777092.777142 /aaai/2002/zhang2002aaai-value/

Abstract

Value iteration is a popular algorithm for solving POMDPs. However, it is inefficient in practice. The primary reason is that it needs to conduct value updates for all the belief states in the (continuous) belief space. In this paper, we study value iteration working with a subset of the belief space, i.e., it conducts value updates only for belief states in the subset. We present a way to select belief subset and describe an algorithm to conduct value iteration over the selected subset. The algorithm is attractive in that it works with belief subset but also retains the quality of the generated values. Given a POMDP, we show how to a priori determine whether the selected subset is a proper subset of belief space. If this is the case, the algorithm carries the advantages of representation in space and efficiency in time.

PDF AAAI Semantic Scholar

Cite

Text

Zhang and Zhang. "Value Iteration Working with Belief Subset." AAAI Conference on Artificial Intelligence, 2002. doi:10.5555/777092.777142

Markdown

[Zhang and Zhang. "Value Iteration Working with Belief Subset." AAAI Conference on Artificial Intelligence, 2002.](https://mlanthology.org/aaai/2002/zhang2002aaai-value/) doi:10.5555/777092.777142

BibTeX

@inproceedings{zhang2002aaai-value,
  title     = {{Value Iteration Working with Belief Subset}},
  author    = {Zhang, Weihong and Zhang, Nevin Lianwen},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2002},
  pages     = {307-313},
  doi       = {10.5555/777092.777142},
  url       = {https://mlanthology.org/aaai/2002/zhang2002aaai-value/}
}