LLM GameLab: An Interactive Platform for Testing Large Language Models in Board Games

Abstract

While large language models are constantly evaluated in various skills, such as math, general knowledge, and coding, their ability to understand and follow game rules has not yet been deeply explored. The latter is especially important as it allows testing whether LLMs can operate within predefined limits without deviating or making illogical mistakes. Therefore, this demo paper presents a tool for interacting with LLMs in board games. The tool allows the creation of players with different large language models pitted against each other or to play in human vs. LLM mode. The platform includes rules predefined in prompts for four simple games based on Tic-Tac-Toe and Connect Four. Each player can be evaluated to account for their illegal movements, wins, draws, losses, and response times. The application also allows for the creation of new games, opening up the possibility of examining LLM behavior in situations they have not previously encountered.

Cite

Text

Morillo et al. "LLM GameLab: An Interactive Platform for Testing Large Language Models in Board Games." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025. doi:10.1007/978-3-032-06129-4_36

Markdown

[Morillo et al. "LLM GameLab: An Interactive Platform for Testing Large Language Models in Board Games." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025.](https://mlanthology.org/ecmlpkdd/2025/morillo2025ecmlpkdd-llm/) doi:10.1007/978-3-032-06129-4_36

BibTeX

@inproceedings{morillo2025ecmlpkdd-llm,
  title     = {{LLM GameLab: An Interactive Platform for Testing Large Language Models in Board Games}},
  author    = {Morillo, Paulina and Terreros, Alex and Ferri, Cèsar and Hernández-Orallo, José},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2025},
  pages     = {486-490},
  doi       = {10.1007/978-3-032-06129-4_36},
  url       = {https://mlanthology.org/ecmlpkdd/2025/morillo2025ecmlpkdd-llm/}
}