Automatic Data Augmentation for Generalization in Reinforcement Learning

Abstract

Deep reinforcement learning (RL) agents often fail to generalize beyond their training environments. To alleviate this problem, recent work has proposed the use of data augmentation. However, different tasks tend to benefit from different types of augmentations and selecting the right one typically requires expert knowledge. In this paper, we introduce three approaches for automatically finding an effective augmentation for any RL task. These are combined with two novel regularization terms for the policy and value function, required to make the use of data augmentation theoretically sound for actor-critic algorithms. Our method achieves a new state-of-the-art on the Procgen benchmark and outperforms popular RL algorithms on DeepMind Control tasks with distractors. In addition, our agent learns policies and representations which are more robust to changes in the environment that are irrelevant for solving the task, such as the background.

Cite

Text

Raileanu et al. "Automatic Data Augmentation for Generalization in Reinforcement Learning." Neural Information Processing Systems, 2021.

Markdown

[Raileanu et al. "Automatic Data Augmentation for Generalization in Reinforcement Learning." Neural Information Processing Systems, 2021.](https://mlanthology.org/neurips/2021/raileanu2021neurips-automatic/)

BibTeX

@inproceedings{raileanu2021neurips-automatic,
  title     = {{Automatic Data Augmentation for Generalization in Reinforcement Learning}},
  author    = {Raileanu, Roberta and Goldstein, Maxwell and Yarats, Denis and Kostrikov, Ilya and Fergus, Rob},
  booktitle = {Neural Information Processing Systems},
  year      = {2021},
  url       = {https://mlanthology.org/neurips/2021/raileanu2021neurips-automatic/}
}