PerfectDou: Dominating DouDizhu with Perfect Information Distillation

Abstract

As a challenging multi-player card game, DouDizhu has recently drawn much attention for analyzing competition and collaboration in imperfect-information games. In this paper, we propose PerfectDou, a state-of-the-art Doudizhu AI system that summits the game, in an actor-critic framework with a proposed technique named perfect information distillation.In detail, we adopt a perfect-training-imperfection-execution framework that allows the agents to utilize the global information to guide the training of the policies as if it is a perfect information game and the trained policies can be used to play the imperfect information game during the actual gameplay. Correspondingly, we characterize card and game features for DouDizhu to represent the perfect and imperfect information. To train our system, we adopt proximal policy optimization with generalized advantage estimation in a parallel training paradigm. In experiments we show how and why PerfectDou beats all existing programs, and achieves state-of-the-art performance.

Cite

Text

Yang et al. "PerfectDou: Dominating DouDizhu with Perfect Information Distillation." Neural Information Processing Systems, 2022.

Markdown

[Yang et al. "PerfectDou: Dominating DouDizhu with Perfect Information Distillation." Neural Information Processing Systems, 2022.](https://mlanthology.org/neurips/2022/yang2022neurips-perfectdou/)

BibTeX

@inproceedings{yang2022neurips-perfectdou,
  title     = {{PerfectDou: Dominating DouDizhu with Perfect Information Distillation}},
  author    = {Yang, Guan and Liu, Minghuan and Hong, Weijun and Zhang, Weinan and Fang, Fei and Zeng, Guangjun and Lin, Yue},
  booktitle = {Neural Information Processing Systems},
  year      = {2022},
  url       = {https://mlanthology.org/neurips/2022/yang2022neurips-perfectdou/}
}