A Symbolic SAT-Based Algorithm for Almost-Sure Reachability with Small Strategies in POMDPs
Abstract
POMDPs are standard models for probabilistic planning problems, where an agent interacts with an uncertain environment. We study the problem of almost-sure reachability, where given a set of target states, the question is to decide whether there is a policy to ensure that the target set is reached with probability 1 (almost-surely). While in general the problem is EXPTIME-complete, in many practical cases policies with a small amount of memory suffice. Moreover, the existing solution to the problem is explicit, which first requires to construct explicitly an exponential reduction to a belief-support MDP. In this work, we first study the existence of observation-stationary strategies, which is NP-complete, and then small-memory strategies. We present a symbolic algorithm by an efficient encoding to SAT and using a SAT solver for the problem. We report experimental results demonstrating the scalability of our symbolic (SAT-based) approach.
Cite
Text
Chatterjee et al. "A Symbolic SAT-Based Algorithm for Almost-Sure Reachability with Small Strategies in POMDPs." AAAI Conference on Artificial Intelligence, 2016. doi:10.1609/AAAI.V30I1.10422Markdown
[Chatterjee et al. "A Symbolic SAT-Based Algorithm for Almost-Sure Reachability with Small Strategies in POMDPs." AAAI Conference on Artificial Intelligence, 2016.](https://mlanthology.org/aaai/2016/chatterjee2016aaai-symbolic/) doi:10.1609/AAAI.V30I1.10422BibTeX
@inproceedings{chatterjee2016aaai-symbolic,
title = {{A Symbolic SAT-Based Algorithm for Almost-Sure Reachability with Small Strategies in POMDPs}},
author = {Chatterjee, Krishnendu and Chmelik, Martin and Davies, Jessica},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2016},
pages = {3225-3232},
doi = {10.1609/AAAI.V30I1.10422},
url = {https://mlanthology.org/aaai/2016/chatterjee2016aaai-symbolic/}
}