MAA*: A Heuristic Search Algorithm for Solving Decentralized POMDPs
Abstract
We present multi-agent A* (MAA*), the first complete and optimal heuristic search algorithm for solving decentralized partially-observable Markov decision problems (DECPOMDPs) with finite horizon. The algorithm is suitable for computing optimal plans for a cooperative group of agents that operate in a stochastic environment such as multi-robot coordination, network traffic control, or distributed resource allocation. Solving such problems effectively is a major challenge in the area of planning under uncertainty. Our solution is based on a synthesis of classical heuristic search and decentralized control theory. Experimental results show that MAA* has significant advantages. We introduce an anytime variant of MAA* and conclude with a discussion of promising extensions such as an approach to solving infinite horizon problems.
Cite
Text
Szer et al. "MAA*: A Heuristic Search Algorithm for Solving Decentralized POMDPs." Conference on Uncertainty in Artificial Intelligence, 2005.Markdown
[Szer et al. "MAA*: A Heuristic Search Algorithm for Solving Decentralized POMDPs." Conference on Uncertainty in Artificial Intelligence, 2005.](https://mlanthology.org/uai/2005/szer2005uai-maa/)BibTeX
@inproceedings{szer2005uai-maa,
title = {{MAA*: A Heuristic Search Algorithm for Solving Decentralized POMDPs}},
author = {Szer, Daniel and Charpillet, François and Zilberstein, Shlomo},
booktitle = {Conference on Uncertainty in Artificial Intelligence},
year = {2005},
pages = {576-590},
url = {https://mlanthology.org/uai/2005/szer2005uai-maa/}
}