Reasoning Gym: Reasoning Environments for Reinforcement Learning with Verifiable Rewards
Abstract
We introduce Reasoning Gym, a library of reasoning environments for reinforcement learning with verifiable rewards (RLVR). It provides over 100 tasks spanning multiple domains including algebra, arithmetic, computation, cognition, geometry, graph theory, logic, and various common games. Its key innovation is the ability to generate virtually infinite training data with adjustable complexity, unlike most previous reasoning datasets, which are typically fixed. This procedural generation approach allows for continuous evaluation across varying difficulty levels and task configurations. Our experimental results demonstrate the efficacy of Reasoning Gym in both evaluating and reinforcement learning of reasoning models.
Cite
Text
Stojanovski et al. "Reasoning Gym: Reasoning Environments for Reinforcement Learning with Verifiable Rewards." Advances in Neural Information Processing Systems, 2025.Markdown
[Stojanovski et al. "Reasoning Gym: Reasoning Environments for Reinforcement Learning with Verifiable Rewards." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/stojanovski2025neurips-reasoning/)BibTeX
@inproceedings{stojanovski2025neurips-reasoning,
title = {{Reasoning Gym: Reasoning Environments for Reinforcement Learning with Verifiable Rewards}},
author = {Stojanovski, Zafir and Stanley, Oliver and Sharratt, Joe and Jones, Richard and Adefioye, Abdulhakeem and Kaddour, Jean and Köpf, Andreas},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/stojanovski2025neurips-reasoning/}
}