XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Abstract
Inspired by the diversity and depth of XLand and the simplicity and minimalism of MiniGrid, we present XLand-MiniGrid, a suite of tools and grid-world environments for meta-reinforcement learning research. Written in JAX, XLand-MiniGrid is designed to be highly scalable and can potentially run on GPU or TPU accelerators, democratizing large-scale experimentation with limited resources. Along with the environments, XLand-MiniGrid provides pre-sampled benchmarks with millions of unique tasks of varying difficulty and easy-to-use baselines that allow users to quickly start training adaptive agents. In addition, we have conducted a preliminary analysis of scaling and generalization, showing that our baselines are capable of reaching millions of steps per second during training and validating that the proposed benchmarks are challenging. XLand-MiniGrid is open-source and available at \url{https://github.com/corl-team/xland-minigrid}.
Cite
Text
Nikulin et al. "XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX." Neural Information Processing Systems, 2024. doi:10.52202/079017-1390Markdown
[Nikulin et al. "XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/nikulin2024neurips-xlandminigrid/) doi:10.52202/079017-1390BibTeX
@inproceedings{nikulin2024neurips-xlandminigrid,
title = {{XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX}},
author = {Nikulin, Alexander and Kurenkov, Vladislav and Zisman, Ilya and Agarkov, Artem and Sinii, Viacheslav and Kolesnikov, Sergey},
booktitle = {Neural Information Processing Systems},
year = {2024},
doi = {10.52202/079017-1390},
url = {https://mlanthology.org/neurips/2024/nikulin2024neurips-xlandminigrid/}
}