Code Driven Planning with Domain-Adaptive Selector

Abstract

Large Language Models (LLMs) have been widely adopted as task planners for AI agents in sequential decision-making problems, leveraging their extensive world knowledge. However, the gap between their general knowledge and environment-specific requirements often leads to inaccurate plans. To address this, existing approaches rely on frequent LLM queries to iteratively refine plans based on immediate environmental feedback, which incurs substantial query costs. However, this refinement is typically guided by short-term environmental feedback, limiting LLMs from developing plans aligned with long-term rewards. We propose **Co**de Driven **P**lanning w**i**th Domain-Adaptive Sele**C**tor (CoPiC). Instead of relying on frequent queries, CoPiC employs LLMs to generate a diverse set of high-level planning programs, which iteratively produce and refine candidate plans. A trained domain-adaptive selector then evaluates these candidates and selects the one most aligned with long-term rewards for execution. Using high-level planning programs as planner and domain-adaptive selector as estimator, CoPiC improves planning while significantly reducing query costs. Results in ALFWorld, NetHack, and StarCraft II Unit Building show that CoPiC outperforms advanced LLM-based baselines, achieving an average (1) 19.14\% improvement in success rate and (2) 79.39\% reduction in token costs.

Cite

Text

Tian et al. "Code Driven Planning with Domain-Adaptive Selector." International Conference on Learning Representations, 2026.

Markdown

[Tian et al. "Code Driven Planning with Domain-Adaptive Selector." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/tian2026iclr-code/)

BibTeX

@inproceedings{tian2026iclr-code,
  title     = {{Code Driven Planning with Domain-Adaptive Selector}},
  author    = {Tian, Zikang and Peng, Shaohui and Huang, Di and Guo, Jiaming and Chen, Ruizhi and Zhang, Rui and Zhang, Xishan and Guo, Yuxuan and Du, Zidong and Guo, Qi and Li, Ling and Pu, Yewen and Hu, Xing and Chen, Yunji},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/tian2026iclr-code/}
}