Privacy-Preserving Policy Iteration for Decentralized POMDPs
Abstract
We propose the first privacy-preserving approach to address the privacy issues that arise in multi-agent planning problems modeled as a Dec-POMDP. Our solution is a distributed message-passing algorithm based on trials, where the agents' policies are optimized using the cross-entropy method. In our algorithm, the agents' private information is protected using a public-key homomorphic cryptosystem. We prove the correctness of our algorithm and analyze its complexity in terms of message passing and encryption/decryption operations. Furthermore, we analyze several privacy aspects of our algorithm and show that it can preserve the agent privacy of non-neighbors, model privacy, and decision privacy. Our experimental results on several common Dec-POMDP benchmark problems confirm the effectiveness of our approach.
Cite
Text
Wu et al. "Privacy-Preserving Policy Iteration for Decentralized POMDPs." AAAI Conference on Artificial Intelligence, 2018. doi:10.1609/AAAI.V32I1.11584Markdown
[Wu et al. "Privacy-Preserving Policy Iteration for Decentralized POMDPs." AAAI Conference on Artificial Intelligence, 2018.](https://mlanthology.org/aaai/2018/wu2018aaai-privacy/) doi:10.1609/AAAI.V32I1.11584BibTeX
@inproceedings{wu2018aaai-privacy,
title = {{Privacy-Preserving Policy Iteration for Decentralized POMDPs}},
author = {Wu, Feng and Zilberstein, Shlomo and Chen, Xiaoping},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2018},
pages = {4759-4766},
doi = {10.1609/AAAI.V32I1.11584},
url = {https://mlanthology.org/aaai/2018/wu2018aaai-privacy/}
}