ProgPrompt: Generating Situated Robot Task Plans Using Large Language Models

Abstract

Task planning can require defining myriad domain knowledge about the world in which a robot needs to act. To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information. However, such methods either require enumerating all possible next steps for scoring, or generate free-form text that may contain actions not possible on a given robot in its current context. We present a programmatic LLM prompt structure that enables plan generation functional across situated environments, robot capabilities, and tasks. Our key insight is to prompt the LLM with program-like specifications of the available actions and objects in an environment, as well as with executable example programs. We make concrete recommendations about prompt structure and generation constraints through ablation experiments, demonstrate state of the art success rates in VirtualHome household tasks, and deploy our method on a physical robot arm for tabletop tasks.

Cite

Text

Singh et al. "ProgPrompt: Generating Situated Robot Task Plans Using Large Language Models." NeurIPS 2022 Workshops: LaReL, 2022.

Markdown

[Singh et al. "ProgPrompt: Generating Situated Robot Task Plans Using Large Language Models." NeurIPS 2022 Workshops: LaReL, 2022.](https://mlanthology.org/neuripsw/2022/singh2022neuripsw-progprompt/)

BibTeX

@inproceedings{singh2022neuripsw-progprompt,
  title     = {{ProgPrompt: Generating Situated Robot Task Plans Using Large Language Models}},
  author    = {Singh, Ishika and Blukis, Valts and Mousavian, Arsalan and Goyal, Ankit and Xu, Danfei and Tremblay, Jonathan and Fox, Dieter and Thomason, Jesse and Garg, Animesh},
  booktitle = {NeurIPS 2022 Workshops: LaReL},
  year      = {2022},
  url       = {https://mlanthology.org/neuripsw/2022/singh2022neuripsw-progprompt/}
}