Symbolic Task Inference in Deep Reinforcement Learning
Abstract
This paper proposes DeepSynth, a method for effective training of deep reinforcement learning agents when the reward is sparse or non-Markovian, but at the same time progress towards the reward requires achieving an unknown sequence of high-level objectives. Our method employs a novel algorithm for synthesis of compact finite state automata to uncover this sequential structure automatically. We synthesise a human-interpretable automaton from trace data collected by exploring the environment. The state space of the environment is then enriched with the synthesised automaton, so that the generation of a control policy by deep reinforcement learning is guided by the discovered structure encoded in the automaton. The proposed approach is able to cope with both high-dimensional, low-level features and unknown sparse or non-Markovian rewards. We have evaluated DeepSynth’s performance in a set of experiments that includes the Atari game Montezuma’s Revenge, known to be challenging. Compared to approaches that rely solely on deep reinforcement learning, we obtain a reduction of two orders of magnitude in the iterations required for policy synthesis, and a significant improvement in scalability.
Cite
Text
Hasanbeig et al. "Symbolic Task Inference in Deep Reinforcement Learning." Journal of Artificial Intelligence Research, 2024. doi:10.1613/JAIR.1.14063Markdown
[Hasanbeig et al. "Symbolic Task Inference in Deep Reinforcement Learning." Journal of Artificial Intelligence Research, 2024.](https://mlanthology.org/jair/2024/hasanbeig2024jair-symbolic/) doi:10.1613/JAIR.1.14063BibTeX
@article{hasanbeig2024jair-symbolic,
title = {{Symbolic Task Inference in Deep Reinforcement Learning}},
author = {Hasanbeig, Hosein and Jeppu, Natasha Yogananda and Abate, Alessandro and Melham, Tom and Kroening, Daniel},
journal = {Journal of Artificial Intelligence Research},
year = {2024},
pages = {1099-1137},
doi = {10.1613/JAIR.1.14063},
volume = {80},
url = {https://mlanthology.org/jair/2024/hasanbeig2024jair-symbolic/}
}