Piecewise Linear Dynamic Programming for Constrained POMDPs

Abstract

We describe an exact dynamic programming update for constrained partially observable Markov decision processes (CPOMDPs). State-of-the-art exact solution of unconstrained POMDPs relies on implicit enumeration of the vectors in the piecewise linear value function, and pruning operations to ob-tain a minimal representation of the updated value function. In dynamic programming for CPOMDPs, each vector takes two valuations, one with respect to the objective function and another with respect to the constraint function. The dynamic programming update consists of finding, for each belief state, the vector that has the best objective function valuation while still satisfying the constraint function. Whereas the pruning operation in an unconstrained POMDP requires solution of a linear program, the pruning operation for CPOMDPs requires

Cite

Text

Isom et al. "Piecewise Linear Dynamic Programming for Constrained POMDPs." AAAI Conference on Artificial Intelligence, 2008.

Markdown

[Isom et al. "Piecewise Linear Dynamic Programming for Constrained POMDPs." AAAI Conference on Artificial Intelligence, 2008.](https://mlanthology.org/aaai/2008/isom2008aaai-piecewise/)

BibTeX

@inproceedings{isom2008aaai-piecewise,
  title     = {{Piecewise Linear Dynamic Programming for Constrained POMDPs}},
  author    = {Isom, Joshua D. and Meyn, Sean P. and Braatz, Richard D.},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2008},
  pages     = {291-296},
  url       = {https://mlanthology.org/aaai/2008/isom2008aaai-piecewise/}
}