Model Minimization in Markov Decision Processes

Abstract

We use the notion of stochastic bisimulation homogeneity to analyze planning problems represented as Markov decision processes (MDPs). Informally, a partition of the state space for an MDP is said to be homogeneous if for each action, states in the same block have the same probability of being carried to each other block. We provide an algorithm for finding the coarsest homogeneous refinement of any partition of the state space of an MDP. The resulting partition can be used to construct a reduced MDP which is minimal in a well defined sense and can be used to solve the original MDP. Our algorithm is an adaptation of known automata minimization algorithms, and is designed to operate naturally on factored or implicit representations in which the full state space is never explicitly enumerated. We show that simple variations on this algorithm are equivalent or closely similar to several different recently published algorithms for finding optimal solutions to (partially ...

Cite

Text

Dean and Givan. "Model Minimization in Markov Decision Processes." AAAI Conference on Artificial Intelligence, 1997.

Markdown

[Dean and Givan. "Model Minimization in Markov Decision Processes." AAAI Conference on Artificial Intelligence, 1997.](https://mlanthology.org/aaai/1997/dean1997aaai-model/)

BibTeX

@inproceedings{dean1997aaai-model,
  title     = {{Model Minimization in Markov Decision Processes}},
  author    = {Dean, Thomas L. and Givan, Robert},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {1997},
  pages     = {106-111},
  url       = {https://mlanthology.org/aaai/1997/dean1997aaai-model/}
}