GEM: A Gym for Generalist LLMs

Abstract

The training paradigm for large language models (LLMs) is moving from static datasets to experience-based learning, where agents acquire skills via interacting with complex environments. To facilitate this transition we introduce GEM (General Experience Maker), an open-source environment simulator designed for the age of LLMs. Analogous to OpenAI-Gym for traditional reinforcement learning (RL), GEM provides a standardized framework for the environment-agent interface, including asynchronous vectorized execution for high throughput, and flexible wrappers for easy extensibility. GEM also features a diverse suite of environments, robust integrated tools, and single-file example scripts demonstrating using GEM with five popular RL training frameworks. Along with this, we also provide a set of baselines across 24 environments using REINFORCE with Return Batch Normalization (ReBN), which---unlike GRPO---is compatible with the full RL setting of dense per-turn rewards and arbitrary discount factors. We further conduct apple-to-apple benchmarking of PPO, GRPO and REINFORCE in both single- and multi-turn settings using GEM to shed light on the algorithmic designs. GEM also functions as a convenient evaluation toolkit besides a training environment. We hope this framework can help accelerate future agentic LLM research.

Cite

Text

Liu et al. "GEM: A Gym for Generalist LLMs." International Conference on Learning Representations, 2026.

Markdown

[Liu et al. "GEM: A Gym for Generalist LLMs." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/liu2026iclr-gem/)

BibTeX

@inproceedings{liu2026iclr-gem,
  title     = {{GEM: A Gym for Generalist LLMs}},
  author    = {Liu, Zichen and Sims, Anya and Duan, Keyu and Chen, Changyu and Yu, Simon and Zhou, Xiangxin and Xu, Haotian and Xiong, Shaopan and Liu, Bo and Tan, Chenmien and Wang, Weixun and Zhu, Hao and Shi, Weiyan and Yang, Diyi and Shieh, Michael Qizhe and Teh, Yee Whye and Lee, Wee Sun and Lin, Min},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/liu2026iclr-gem/}
}