Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning

Zhao, Harry; Alver, Safa; van Seijen, Harm; Laroche, Romain; Precup, Doina; Bengio, Yoshua

Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning

Harry Zhao, Safa Alver, Harm van Seijen, Romain Laroche, Doina Precup, Yoshua Bengio

ICLR 2024

/iclr/2024/zhao2024iclr-consciousnessinspired/

Abstract

Inspired by human conscious planning, we propose Skipper, a model-based reinforcement learning framework utilizing spatio-temporal abstractions to generalize better in novel situations. It automatically decomposes the given task into smaller, more manageable subtasks, and thus enables sparse decision-making and focused computation on the relevant parts of the environment. The decomposition relies on the extraction of an abstracted proxy problem represented as a directed graph, in which vertices and edges are learned end-to-end from hindsight. Our theoretical analyses provide performance guarantees under appropriate assumptions and establish where our approach is expected to be helpful. Generalization-focused experiments validate Skipper’s significant advantage in zero-shot generalization, compared to some existing state-of-the-art hierarchical planning methods.

PDF ICLR Semantic Scholar

Cite

Text

Zhao et al. "Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning." International Conference on Learning Representations, 2024.

Markdown

[Zhao et al. "Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning." International Conference on Learning Representations, 2024.](https://mlanthology.org/iclr/2024/zhao2024iclr-consciousnessinspired/)

BibTeX

@inproceedings{zhao2024iclr-consciousnessinspired,
  title     = {{Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning}},
  author    = {Zhao, Harry and Alver, Safa and van Seijen, Harm and Laroche, Romain and Precup, Doina and Bengio, Yoshua},
  booktitle = {International Conference on Learning Representations},
  year      = {2024},
  url       = {https://mlanthology.org/iclr/2024/zhao2024iclr-consciousnessinspired/}
}