Decision Making as Language Generation
Abstract
Decision transformers are a recently proposed approach to offline reinforcement learning that leverages transformer-based auto-regressive sequence models. We discuss challenges associated with fine-tuning a given, pre-trained language model on a decision making task. We propose solutions to these challenges and study their viability on a shortest path problem. We also show how given language model allows us to bring to bear data-centric approaches to improving the model and how it opens up the possibility to treat the decision transformer objective as one task alongside others to perform transfer learning.
Cite
Text
Memisevic et al. "Decision Making as Language Generation." NeurIPS 2022 Workshops: FMDM, 2022.Markdown
[Memisevic et al. "Decision Making as Language Generation." NeurIPS 2022 Workshops: FMDM, 2022.](https://mlanthology.org/neuripsw/2022/memisevic2022neuripsw-decision/)BibTeX
@inproceedings{memisevic2022neuripsw-decision,
title = {{Decision Making as Language Generation}},
author = {Memisevic, Roland and Panchal, Sunny and Lee, Mingu},
booktitle = {NeurIPS 2022 Workshops: FMDM},
year = {2022},
url = {https://mlanthology.org/neuripsw/2022/memisevic2022neuripsw-decision/}
}