Combining Code Generating Large Language Models and Self-Play to Iteratively Refine Strategies in Games

Bachrach, Yoram; Toledo, Edan; Hambardzumyan, Karen; Magka, Despoina; Josifoski, Martin; Jiang, Minqi; Foerster, Jakob N.; Raileanu, Roberta; Shavrina, Tatiana; Cancedda, Nicola; Ruderman, Avraham; Millican, Katie; Lupu, Andrei; Hazra, Rishi

doi:10.24963/IJCAI.2025/1249

Combining Code Generating Large Language Models and Self-Play to Iteratively Refine Strategies in Games

Yoram Bachrach, Edan Toledo, Karen Hambardzumyan, Despoina Magka, Martin Josifoski, Minqi Jiang, Jakob N. Foerster, Roberta Raileanu, Tatiana Shavrina, Nicola Cancedda, Avraham Ruderman, Katie Millican, Andrei Lupu, Rishi Hazra

IJCAI 2025 pp. 10999-11003

doi:10.24963/IJCAI.2025/1249 /ijcai/2025/bachrach2025ijcai-combining/

Abstract

We propose a self-play approach to generating strategies for playing in multi-player games, where strategies are represented as computer code. We use large language models (LLMs) to generate pieces of code to play in the game, which we refer to as generated bots. We engage the LLM generated bots in competitions, designed to generate increasingly stronger strategies. We follow game theoretic principles in organizing these tournaments, and use a Policy Space Response Oracle (PSRO) approach. We start with an initial set of LLM generated bots, and continue in rounds for adding new bots into the population. Each round adds a bot to the population by asking the LLM to produce code for playing against a bot representing the Nash equilibrium mixture over the current population. Our analysis shows that even a few rounds are sufficient to produces strong bots for playing the game. Our demo shows the process for the game of Checkers. We allow users to select initial bots in the population, run the process, inspect how the bots evolve over time, and play against the generated bots.

PDF IJCAI Semantic Scholar

Cite

Text

Bachrach et al. "Combining Code Generating Large Language Models and Self-Play to Iteratively Refine Strategies in Games." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/1249

Markdown

[Bachrach et al. "Combining Code Generating Large Language Models and Self-Play to Iteratively Refine Strategies in Games." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/bachrach2025ijcai-combining/) doi:10.24963/IJCAI.2025/1249

BibTeX

@inproceedings{bachrach2025ijcai-combining,
  title     = {{Combining Code Generating Large Language Models and Self-Play to Iteratively Refine Strategies in Games}},
  author    = {Bachrach, Yoram and Toledo, Edan and Hambardzumyan, Karen and Magka, Despoina and Josifoski, Martin and Jiang, Minqi and Foerster, Jakob N. and Raileanu, Roberta and Shavrina, Tatiana and Cancedda, Nicola and Ruderman, Avraham and Millican, Katie and Lupu, Andrei and Hazra, Rishi},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {10999-11003},
  doi       = {10.24963/IJCAI.2025/1249},
  url       = {https://mlanthology.org/ijcai/2025/bachrach2025ijcai-combining/}
}