Choosing Well Your Opponents: How to Guide the Synthesis of Programmatic Strategies

Abstract

This paper introduces Local Learner (2L), an algorithm for providing a set of reference strategies to guide the search for programmatic strategies in two-player zero-sum games. Previous learning algorithms, such as Iterated Best Response (IBR), Fictitious Play (FP), and Double-Oracle (DO), can be computationally expensive or miss important information for guiding search algorithms. 2L actively selects a set of reference strategies to improve the search signal. We empirically demonstrate the advantages of our approach while guiding a local search algorithm for synthesizing strategies in three games, including MicroRTS, a challenging real-time strategy game. Results show that 2L learns reference strategies that provide a stronger search signal than IBR, FP, and DO. We also simulate a tournament of MicroRTS, where a synthesizer using 2L outperformed the winners of the two latest MicroRTS competitions, which were programmatic strategies written by human programmers.

Cite

Text

Moraes et al. "Choosing Well Your Opponents: How to Guide the Synthesis of Programmatic Strategies." International Joint Conference on Artificial Intelligence, 2023. doi:10.24963/IJCAI.2023/539

Markdown

[Moraes et al. "Choosing Well Your Opponents: How to Guide the Synthesis of Programmatic Strategies." International Joint Conference on Artificial Intelligence, 2023.](https://mlanthology.org/ijcai/2023/moraes2023ijcai-choosing/) doi:10.24963/IJCAI.2023/539

BibTeX

@inproceedings{moraes2023ijcai-choosing,
  title     = {{Choosing Well Your Opponents: How to Guide the Synthesis of Programmatic Strategies}},
  author    = {Moraes, Rubens O. and Aleixo, David S. and Ferreira, Lucas N. and Lelis, Levi H. S.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {4847-4854},
  doi       = {10.24963/IJCAI.2023/539},
  url       = {https://mlanthology.org/ijcai/2023/moraes2023ijcai-choosing/}
}