Approximate Linear Programming for First-Order MDPs

UAI 2005 pp. 509-517

/uai/2005/sanner2005uai-approximate/

Abstract

We introduce a new approximate solution technique for first-order Markov decision processes (FOMDPs). Representing the value function linearly w.r.t. a set of first-order basis functions, we compute suitable weights by casting the corresponding optimization as a first-order linear program and show how off-the-shelf theorem prover and LP software can be effectively used. This technique allows one to solve FOMDPs independent of a specific domain instantiation; furthermore, it allows one to determine bounds on approximation error that apply equally to all domain instantiations. We apply this solution technique to the task of elevator scheduling with a rich feature space and multi-criteria additive reward, and demonstrate that it outperforms a number of intuitive, heuristicallyguided policies.

UAI Semantic Scholar

Cite

Text

Sanner and Boutilier. "Approximate Linear Programming for First-Order MDPs." Conference on Uncertainty in Artificial Intelligence, 2005.

Markdown

[Sanner and Boutilier. "Approximate Linear Programming for First-Order MDPs." Conference on Uncertainty in Artificial Intelligence, 2005.](https://mlanthology.org/uai/2005/sanner2005uai-approximate/)

BibTeX

@inproceedings{sanner2005uai-approximate,
  title     = {{Approximate Linear Programming for First-Order MDPs}},
  author    = {Sanner, Scott and Boutilier, Craig},
  booktitle = {Conference on Uncertainty in Artificial Intelligence},
  year      = {2005},
  pages     = {509-517},
  url       = {https://mlanthology.org/uai/2005/sanner2005uai-approximate/}
}