Towards Efficient Detection and Optimal Response Against Sophisticated Opponents
Abstract
Multiagent algorithms often aim to accurately predict the behaviors of other agents and find a best response accordingly. Previous works usually assume an opponent uses a stationary strategy or randomly switches among several stationary ones. However, an opponent may exhibit more sophisticated behaviors by adopting more advanced reasoning strategies, e.g., using a Bayesian reasoning strategy. This paper proposes a novel approach called Bayes-ToMoP which can efficiently detect the strategy of opponents using either stationary or higher-level reasoning strategies. Bayes-ToMoP also supports the detection of previously unseen policies and learning a best-response policy accordingly. We provide a theoretical guarantee of the optimality on detecting the opponent's strategies. We also propose a deep version of Bayes-ToMoP by extending Bayes-ToMoP with DRL techniques. Experimental results show both Bayes-ToMoP and deep Bayes-ToMoP outperform the state-of-the-art approaches when faced with different types of opponents in two-agent competitive games.
Cite
Text
Yang et al. "Towards Efficient Detection and Optimal Response Against Sophisticated Opponents." International Joint Conference on Artificial Intelligence, 2019. doi:10.24963/IJCAI.2019/88Markdown
[Yang et al. "Towards Efficient Detection and Optimal Response Against Sophisticated Opponents." International Joint Conference on Artificial Intelligence, 2019.](https://mlanthology.org/ijcai/2019/yang2019ijcai-efficient-a/) doi:10.24963/IJCAI.2019/88BibTeX
@inproceedings{yang2019ijcai-efficient-a,
title = {{Towards Efficient Detection and Optimal Response Against Sophisticated Opponents}},
author = {Yang, Tianpei and Hao, Jianye and Meng, Zhaopeng and Zhang, Chongjie and Zheng, Yan and Zheng, Ze},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2019},
pages = {623-629},
doi = {10.24963/IJCAI.2019/88},
url = {https://mlanthology.org/ijcai/2019/yang2019ijcai-efficient-a/}
}