On Reinforcement Learning for Full-Length Game of StarCraft

Abstract

StarCraft II poses a grand challenge for reinforcement learning. The main difficulties include huge state space, varying action space, long horizon, etc. In this paper, we investigate a set of techniques of reinforcement learning for the full-length game of StarCraft II. We investigate a hierarchical approach, where the hierarchy involves two levels of abstraction. One is the macro-actions extracted from expert’s demonstration trajectories, which can reduce the action space in an order of magnitude yet remain effective. The other is a two-layer hierarchical architecture, which is modular and easy to scale. We also investigate a curriculum transfer learning approach that trains the agent from the simplest opponent to harder ones. On a 64×64 map and using restrictive units, we train the agent on a single machine with 4 GPUs and 48 CPU threads. We achieve a winning rate of more than 99% against the difficulty level-1 built-in AI. Through the curriculum transfer learning algorithm and a mixture of combat model, we can achieve over 93% winning rate against the most difficult noncheating built-in AI (level-7) within days. We hope this study could shed some light on the future research of large-scale reinforcement learning.

Cite

Text

Pang et al. "On Reinforcement Learning for Full-Length Game of StarCraft." AAAI Conference on Artificial Intelligence, 2019. doi:10.1609/AAAI.V33I01.33014691

Markdown

[Pang et al. "On Reinforcement Learning for Full-Length Game of StarCraft." AAAI Conference on Artificial Intelligence, 2019.](https://mlanthology.org/aaai/2019/pang2019aaai-reinforcement/) doi:10.1609/AAAI.V33I01.33014691

BibTeX

@inproceedings{pang2019aaai-reinforcement,
  title     = {{On Reinforcement Learning for Full-Length Game of StarCraft}},
  author    = {Pang, Zhen-Jia and Liu, Ruo-Ze and Meng, Zhou-Yu and Zhang, Yi and Yu, Yang and Lu, Tong},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2019},
  pages     = {4691-4698},
  doi       = {10.1609/AAAI.V33I01.33014691},
  url       = {https://mlanthology.org/aaai/2019/pang2019aaai-reinforcement/}
}