Solving Concurrent Markov Decision Processes
Abstract
Typically, Markov decision problems (MDPs) assume a single action is executed per decision epoch, but in the real world one may frequently exe-cute certain actions in parallel. This paper explores concurrent MDPs, MDPs which allow multiple non-conflicting actions to be executed simultaneously, and presents two new algorithms. Our first approach exploits two prov-ably sound pruning rules, and thus guarantees solution optimality. Our sec-ond technique is a fast, sampling-based algorithm, which produces close-to-optimal solutions extremely quickly. Experiments show that our approaches outperform the existing algorithms producing up to two orders of magnitude speedup. 1
Cite
Text
Mausam and Weld. "Solving Concurrent Markov Decision Processes." AAAI Conference on Artificial Intelligence, 2004.Markdown
[Mausam and Weld. "Solving Concurrent Markov Decision Processes." AAAI Conference on Artificial Intelligence, 2004.](https://mlanthology.org/aaai/2004/mausam2004aaai-solving/)BibTeX
@inproceedings{mausam2004aaai-solving,
title = {{Solving Concurrent Markov Decision Processes}},
author = {Mausam, and Weld, Daniel S.},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2004},
pages = {716-722},
url = {https://mlanthology.org/aaai/2004/mausam2004aaai-solving/}
}