Solving Factored MDPs via Non-Homogeneous Partitioning
Abstract
This paper describes an algorithm for solving large state-space MDPs (represented as factored MDPs) using search by successive refinement in the space of non-homogeneous partitions. Homogeneity is defined in terms of bisimulation and reward equivalence within blocks of a partition. Since homogeneous partitions that define equivalent reduced state-space MDPs can have a large number of blocks, we relax the requirement of homogeneity. The algorithm constructs approximate aggregate MDPs from non-homogeneous partitions, solves the aggregate MDPs exactly, and then uses the resulting value functions as part of a heuristic in refining the current best non-homogeneous partition. We outline the theory motivating the use of this heuristic and present empirical results and comparisons.
Cite
Text
Kim and Dean. "Solving Factored MDPs via Non-Homogeneous Partitioning." International Joint Conference on Artificial Intelligence, 2001.Markdown
[Kim and Dean. "Solving Factored MDPs via Non-Homogeneous Partitioning." International Joint Conference on Artificial Intelligence, 2001.](https://mlanthology.org/ijcai/2001/kim2001ijcai-solving/)BibTeX
@inproceedings{kim2001ijcai-solving,
title = {{Solving Factored MDPs via Non-Homogeneous Partitioning}},
author = {Kim, Kee-Eung and Dean, Thomas L.},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2001},
pages = {683-689},
url = {https://mlanthology.org/ijcai/2001/kim2001ijcai-solving/}
}