Knot so Simple: A Minimalistic Environment for Spatial Reasoning

Abstract

We propose KnotGym, an interactive environment for complex, spatial reasoning and manipulation. KnotGym includes goal-oriented rope manipulation tasks with varying levels of complexity, all requiring acting from pure image observations. Tasks are defined along a clear and quantifiable axis of complexity based on the number of knot crossings, creating a natural generalization test. KnotGym has a simple observation space, allowing for scalable development, yet it highlights core challenges in integrating acute perception, spatial reasoning, and grounded manipulation. We evaluate methods of different classes, including model-based RL, model-predictive control, and chain-of-thought reasoning, and illustrate the challenges KnotGym presents.

Cite

Text

Chen and Artzi. "Knot so Simple: A Minimalistic Environment for Spatial Reasoning." Advances in Neural Information Processing Systems, 2025.

Markdown

[Chen and Artzi. "Knot so Simple: A Minimalistic Environment for Spatial Reasoning." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/chen2025neurips-knot/)

BibTeX

@inproceedings{chen2025neurips-knot,
  title     = {{Knot so Simple: A Minimalistic Environment for Spatial Reasoning}},
  author    = {Chen, Zizhao and Artzi, Yoav},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/chen2025neurips-knot/}
}