SPUDD: Stochastic Planning Using Decision Diagrams

Abstract

Recently, structured methods for solving factored Markov decisions processes (MDPs) with large state spaces have been proposed recently to allow dynamic programming to be applied without the need for complete state enumeration. We propose and examine a new value iteration algorithm for MDPs that uses algebraic decision diagrams (ADDs) to represent value functions and policies, assuming an ADD input representation of the MDP. Dynamic programming is implemented via ADD manipulation. We demonstrate our method on a class of large MDPs (up to 63 million states) and show that significant gains can be had when compared to tree-structured representations (with up to a thirty-fold reduction in the number of nodes required to represent optimal value functions).

Cite

Text

Hoey et al. "SPUDD: Stochastic Planning Using Decision Diagrams." Conference on Uncertainty in Artificial Intelligence, 1999.

Markdown

[Hoey et al. "SPUDD: Stochastic Planning Using Decision Diagrams." Conference on Uncertainty in Artificial Intelligence, 1999.](https://mlanthology.org/uai/1999/hoey1999uai-spudd/)

BibTeX

@inproceedings{hoey1999uai-spudd,
  title     = {{SPUDD: Stochastic Planning Using Decision Diagrams}},
  author    = {Hoey, Jesse and St-Aubin, Robert and Hu, Alan J. and Boutilier, Craig},
  booktitle = {Conference on Uncertainty in Artificial Intelligence},
  year      = {1999},
  pages     = {279-288},
  url       = {https://mlanthology.org/uai/1999/hoey1999uai-spudd/}
}