Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning
Abstract
Learning effective visual representations is crucial in open-world environments where agents encounter diverse and unstructured observations. This ability enables agents to extract meaningful information from raw sensory inputs, like pixels, which is essential for generalization across different tasks. However, evaluating representation learning separately from policy learning remains a challenge in most reinforcement learning (RL) benchmarks. To address this, we introduce the Sliding Puzzles Gym (SPGym), a benchmark that extends the classic 15-tile puzzle with variable grid sizes and observation spaces, including large real-world image datasets. SPGym allows scaling the representation learning challenge while keeping the latent environment dynamics and algorithmic problem fixed, providing a targeted assessment of agents' ability to form compositional and generalizable state representations. Our experiments with both model-free and model-based RL algorithms, with and without explicit representation learning components, show that as the representation challenge scales, SPGym effectively distinguishes agents based on their capabilities. Moreover, SPGym reaches difficulty levels where no tested algorithm consistently excels, highlighting key challenges and opportunities for advancing representation learning for decision-making research.
Cite
Text
de Oliveira et al. "Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning." NeurIPS 2024 Workshops: OWA, 2024.Markdown
[de Oliveira et al. "Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning." NeurIPS 2024 Workshops: OWA, 2024.](https://mlanthology.org/neuripsw/2024/deoliveira2024neuripsw-sliding/)BibTeX
@inproceedings{deoliveira2024neuripsw-sliding,
title = {{Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning}},
author = {de Oliveira, Bryan Lincoln Marques and Brandão, Bruno and da Luz, Murilo Lopes and Martins, Luana Guedes Barros and de Lima Soares, Telma Woerle and Melo, Luckeciano Carvalho},
booktitle = {NeurIPS 2024 Workshops: OWA},
year = {2024},
url = {https://mlanthology.org/neuripsw/2024/deoliveira2024neuripsw-sliding/}
}