Composing Task Knowledge with Modular Successor Feature Approximators

Carvalho, Wilka Torrico; Filos, Angelos; Lewis, Richard; Lee, Honglak; Singh, Satinder

Composing Task Knowledge with Modular Successor Feature Approximators

Wilka Torrico Carvalho, Angelos Filos, Richard Lewis, Honglak Lee, Satinder Singh

ICLR 2023

/iclr/2023/carvalho2023iclr-composing/

Abstract

Recently, the Successor Features and Generalized Policy Improvement (SF&GPI) framework has been proposed as a method for learning, composing and transferring predictive knowledge and behavior. SF&GPI works by having an agent learn predictive representations (SFs) that can be combined for transfer to new tasks with GPI. However, to be effective this approach requires state features that are useful to predict, and these state-features are typically hand-designed. In this work, we present a novel neural network architecture, “Modular Successor Feature Approximators” (MSFA), where modules both discover what is useful to predict, and learn their own predictive representations. We show that MSFA is able to better generalize compared to baseline architectures for learning SFs and a modular network that discovers factored state representations.

PDF ICLR Semantic Scholar

Cite

Text

Carvalho et al. "Composing Task Knowledge with Modular Successor Feature Approximators." International Conference on Learning Representations, 2023.

Markdown

[Carvalho et al. "Composing Task Knowledge with Modular Successor Feature Approximators." International Conference on Learning Representations, 2023.](https://mlanthology.org/iclr/2023/carvalho2023iclr-composing/)

BibTeX

@inproceedings{carvalho2023iclr-composing,
  title     = {{Composing Task Knowledge with Modular Successor Feature Approximators}},
  author    = {Carvalho, Wilka Torrico and Filos, Angelos and Lewis, Richard and Lee, Honglak and Singh, Satinder},
  booktitle = {International Conference on Learning Representations},
  year      = {2023},
  url       = {https://mlanthology.org/iclr/2023/carvalho2023iclr-composing/}
}