Symbolic Task Inference in Deep Reinforcement Learning

Abstract

This paper proposes DeepSynth, a method for effective training of deep reinforcement learning agents when the reward is sparse or non-Markovian, but at the same time progress towards the reward requires achieving an unknown sequence of high-level objectives. Our method employs a novel algorithm for synthesis of compact finite state automata to uncover this sequential structure automatically. We synthesise a human-interpretable automaton from trace data collected by exploring the environment. The state space of the environment is then enriched with the synthesised automaton, so that the generation of a control policy by deep reinforcement learning is guided by the discovered structure encoded in the automaton. The proposed approach is able to cope with both high-dimensional, low-level features and unknown sparse or non-Markovian rewards. We have evaluated DeepSynth’s performance in a set of experiments that includes the Atari game Montezuma’s Revenge, known to be challenging. Compared to approaches that rely solely on deep reinforcement learning, we obtain a reduction of two orders of magnitude in the iterations required for policy synthesis, and a significant improvement in scalability.

Cite

Text

Hasanbeig et al. "Symbolic Task Inference in Deep Reinforcement Learning." Journal of Artificial Intelligence Research, 2024. doi:10.1613/JAIR.1.14063

Markdown

[Hasanbeig et al. "Symbolic Task Inference in Deep Reinforcement Learning." Journal of Artificial Intelligence Research, 2024.](https://mlanthology.org/jair/2024/hasanbeig2024jair-symbolic/) doi:10.1613/JAIR.1.14063

BibTeX

@article{hasanbeig2024jair-symbolic,
  title     = {{Symbolic Task Inference in Deep Reinforcement Learning}},
  author    = {Hasanbeig, Hosein and Jeppu, Natasha Yogananda and Abate, Alessandro and Melham, Tom and Kroening, Daniel},
  journal   = {Journal of Artificial Intelligence Research},
  year      = {2024},
  pages     = {1099-1137},
  doi       = {10.1613/JAIR.1.14063},
  volume    = {80},
  url       = {https://mlanthology.org/jair/2024/hasanbeig2024jair-symbolic/}
}