Solving Very Large Weakly Coupled Markov Decision Processes

Meuleau, Nicolas; Hauskrecht, Milos; Kim, Kee-Eung; Peshkin, Leonid; Kaelbling, Leslie Pack; Dean, Thomas L.; Boutilier, Craig

Solving Very Large Weakly Coupled Markov Decision Processes

Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, Leonid Peshkin, Leslie Pack Kaelbling, Thomas L. Dean, Craig Boutilier

AAAI 1998 pp. 165-172

/aaai/1998/meuleau1998aaai-solving/

Abstract

We present a technique for computing approximately optimal solutions to stochastic resource allocation problems modeled as Markov decision processes (MDPs). We exploit two key properties to avoid explicitly enumerating the very large state and action spaces associated with these problems. First, the problems are composed of multiple tasks whose utilities are independent. Second, the actions taken with respect to (or resources allocated to) a task do not influence the status of any other task. We can therefore view each task as an MDP. However, these MDPs are weakly coupled by resource constraints: actions selected for one MDP restrict the actions available to others. We describe heuristic techniques for dealing with several classes of constraints that use the solutions for individual MDPs to construct an approximate global solution. We demonstrate this technique on problems involving thousandsof tasks, approximating the solution to problems that are far beyond the reach of standard methods. 1

PDF AAAI Semantic Scholar

Cite

Text

Meuleau et al. "Solving Very Large Weakly Coupled Markov Decision Processes." AAAI Conference on Artificial Intelligence, 1998.

Markdown

[Meuleau et al. "Solving Very Large Weakly Coupled Markov Decision Processes." AAAI Conference on Artificial Intelligence, 1998.](https://mlanthology.org/aaai/1998/meuleau1998aaai-solving/)

BibTeX

@inproceedings{meuleau1998aaai-solving,
  title     = {{Solving Very Large Weakly Coupled Markov Decision Processes}},
  author    = {Meuleau, Nicolas and Hauskrecht, Milos and Kim, Kee-Eung and Peshkin, Leonid and Kaelbling, Leslie Pack and Dean, Thomas L. and Boutilier, Craig},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {1998},
  pages     = {165-172},
  url       = {https://mlanthology.org/aaai/1998/meuleau1998aaai-solving/}
}