Error-Bounded Approximations for Infinite-Horizon Discounted Decentralized POMDPs
Abstract
We address decentralized stochastic control problems represented as decentralized partially observable Markov decision processes (Dec-POMDPs). This formalism provides a general model for decision-making under uncertainty in cooperative, decentralized settings, but the worst-case complexity makes it difficult to solve optimally (NEXP-complete). Recent advances suggest recasting Dec-POMDPs into continuous-state and deterministic MDPs. In this form, however, states and actions are embedded into high-dimensional spaces, making accurate estimate of states and greedy selection of actions intractable for all but trivial-sized problems. The primary contribution of this paper is the first framework for error-monitoring during approximate estimation of states and selection of actions. Such a framework permits us to convert state-of-the-art exact methods into error-bounded algorithms, which results in a scalability increase as demonstrated by experiments over problems of unprecedented sizes.
Cite
Text
Dibangoye et al. "Error-Bounded Approximations for Infinite-Horizon Discounted Decentralized POMDPs." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2014. doi:10.1007/978-3-662-44848-9_22Markdown
[Dibangoye et al. "Error-Bounded Approximations for Infinite-Horizon Discounted Decentralized POMDPs." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2014.](https://mlanthology.org/ecmlpkdd/2014/dibangoye2014ecmlpkdd-errorbounded/) doi:10.1007/978-3-662-44848-9_22BibTeX
@inproceedings{dibangoye2014ecmlpkdd-errorbounded,
title = {{Error-Bounded Approximations for Infinite-Horizon Discounted Decentralized POMDPs}},
author = {Dibangoye, Jilles Steeve and Buffet, Olivier and Charpillet, François},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2014},
pages = {338-353},
doi = {10.1007/978-3-662-44848-9_22},
url = {https://mlanthology.org/ecmlpkdd/2014/dibangoye2014ecmlpkdd-errorbounded/}
}