Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling

Thomy Phan, Lenz Belzner, Marie Kiermeier, Markus Friedrich, Kyrill Schmid, Claudia Linnhoff-Popien

AAAI 2019 pp. 7941-7948

doi:10.1609/AAAI.V33I01.33017941 /aaai/2019/phan2019aaai-memory/

Abstract

State-of-the-art approaches to partially observable planning like POMCP are based on stochastic tree search. While these approaches are computationally efficient, they may still construct search trees of considerable size, which could limit the performance due to restricted memory resources. In this paper, we propose Partially Observable Stacked Thompson Sampling (POSTS), a memory bounded approach to openloop planning in large POMDPs, which optimizes a fixed size stack of Thompson Sampling bandits. We empirically evaluate POSTS in four large benchmark problems and compare its performance with different tree-based approaches. We show that POSTS achieves competitive performance compared to tree-based open-loop planning and offers a performancememory tradeoff, making it suitable for partially observable planning with highly restricted computational and memory resources.

PDF AAAI Semantic Scholar

Cite

Text

Phan et al. "Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling." AAAI Conference on Artificial Intelligence, 2019. doi:10.1609/AAAI.V33I01.33017941

Markdown

[Phan et al. "Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling." AAAI Conference on Artificial Intelligence, 2019.](https://mlanthology.org/aaai/2019/phan2019aaai-memory/) doi:10.1609/AAAI.V33I01.33017941

BibTeX

@inproceedings{phan2019aaai-memory,
  title     = {{Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling}},
  author    = {Phan, Thomy and Belzner, Lenz and Kiermeier, Marie and Friedrich, Markus and Schmid, Kyrill and Linnhoff-Popien, Claudia},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2019},
  pages     = {7941-7948},
  doi       = {10.1609/AAAI.V33I01.33017941},
  url       = {https://mlanthology.org/aaai/2019/phan2019aaai-memory/}
}