Natural Language PDDL (NL-PDDL) for Open-World Goal-Oriented Commonsense Regression Planning in Embodied AI

Abstract

Planning in open-world environments, where agents must act with partially observed states and incomplete knowledge, is a central challenge in embodied AI. Open-world planning involves not only sequencing actions but also determining what information the agent needs to sense to enable those actions. Existing approaches using Large Language Models (LLM) and Vision-Language Models (VLM) cannot reliably plan over long horizons and complex goals, where they often hallucinate and fail to reason causally over agent-environment interactions. Alternatively, classical PDDL planners offer correct and principled reasoning, but fail in open-world settings: they presuppose complete models and depend on exhaustive grounding over all objects, states, and actions; they cannot address misalignment between goal specifications (e.g., “heat the bread”) and action specifications (e.g., “toast the bread”); and they do not generalize across modalities (e.g., text, vision). To address these core challenges: (i) we extend symbolic PDDL into a flexible natural language representation that we term NL-PDDL, improving accessibility for non-expert users as well as generalization over modalities; (ii) we generalize regression-style planning to NL-PDDL with commonsense entailment reasoning to determine what needs to be observed for goal achievement in partially-observed environments with potential goal–action specification misalignment; and (iii) we leverage the lifted specification of NL-PDDL to facilitate open-world planning that avoids exhaustive grounding and yields a time and space complexity independent of the number of ground objects, states, and actions. Our experiments in three diverse domains — classical Blocksworld and the embodied ALFWorld environment with both textual and visual states — show that NL-PDDL substantially outperforms existing baselines, is more robust to longer horizons and more complex goals, and generalizes across modalities.

Cite

Text

Liu et al. "Natural Language PDDL (NL-PDDL) for Open-World Goal-Oriented Commonsense Regression Planning in Embodied AI." International Conference on Learning Representations, 2026.

Markdown

[Liu et al. "Natural Language PDDL (NL-PDDL) for Open-World Goal-Oriented Commonsense Regression Planning in Embodied AI." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/liu2026iclr-natural/)

BibTeX

@inproceedings{liu2026iclr-natural,
  title     = {{Natural Language PDDL (NL-PDDL) for Open-World Goal-Oriented Commonsense Regression Planning in Embodied AI}},
  author    = {Liu, Xiaotian and Toroghi, Armin and Liang, Jiazhou and Courtis, David and Li, Ruiwen and Pesaranghader, Ali and Kim, Jaehong and Sadhu, Tanmana and Jeon, Hyejeong and Sanner, Scott},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/liu2026iclr-natural/}
}