Jumanji: A Diverse Suite of Scalable Reinforcement Learning Environments in JAX

Abstract

Open-source reinforcement learning (RL) environments have played a crucial role in driving progress in the development of AI algorithms. In modern RL research, there is a need for simulated environments that are performant, scalable, and modular to enable their utilization in a wider range of potential real-world applications. Therefore, we present Jumanji, a suite of diverse RL environments specifically designed to be fast, flexible, and scalable. Jumanji provides a suite of environments focusing on combinatorial problems frequently encountered in industry, as well as challenging general decision-making tasks. By leveraging the efficiency of JAX and hardware accelerators like GPUs and TPUs, Jumanji enables rapid iteration of research ideas and large-scale experimentation, ultimately empowering more capable agents. Unlike existing RL environment suites, Jumanji is highly customizable, allowing users to tailor the initial state distribution and problem complexity to their needs. Furthermore, we provide actor-critic baselines for each environment, accompanied by preliminary findings on scaling and generalization scenarios. Jumanji aims to set a new standard for speed, adaptability, and scalability of RL environments.

Cite

Text

Bonnet et al. "Jumanji: A Diverse Suite of Scalable Reinforcement Learning Environments in JAX." International Conference on Learning Representations, 2024.

Markdown

[Bonnet et al. "Jumanji: A Diverse Suite of Scalable Reinforcement Learning Environments in JAX." International Conference on Learning Representations, 2024.](https://mlanthology.org/iclr/2024/bonnet2024iclr-jumanji/)

BibTeX

@inproceedings{bonnet2024iclr-jumanji,
  title     = {{Jumanji: A Diverse Suite of Scalable Reinforcement Learning Environments in JAX}},
  author    = {Bonnet, Clément and Luo, Daniel and Byrne, Donal John and Surana, Shikha and Abramowitz, Sasha and Duckworth, Paul and Coyette, Vincent and Midgley, Laurence Illing and Tegegn, Elshadai and Kalloniatis, Tristan and Mahjoub, Omayma and Macfarlane, Matthew and Smit, Andries Petrus and Grinsztajn, Nathan and Boige, Raphael and Waters, Cemlyn Neil and Mimouni, Mohamed Ali Ali and Sob, Ulrich Armel Mbou and de Kock, Ruan John and Singh, Siddarth and Furelos-Blanco, Daniel and Le, Victor and Pretorius, Arnu and Laterre, Alexandre},
  booktitle = {International Conference on Learning Representations},
  year      = {2024},
  url       = {https://mlanthology.org/iclr/2024/bonnet2024iclr-jumanji/}
}