TPTU-V2: Boosting Task Planning and Tool Usage of Large Language Model-Based Agents in Real-World Systems

Yilun Kong, Jingqing Ruan, YiHong Chen, Bin Zhang, Tianpeng Bao, Shi Shiwei, Du Guo Qing, Xiaoru Hu, Hangyu Mao, Ziyue Li, Xingyu Zeng, Rui Zhao, Xueqian Wang

ICLRW 2024

/iclrw/2024/kong2024iclrw-tptuv2/

Abstract

Large Language Models (LLMs) have demonstrated proficiency in addressing tasks that necessitate a combination of task planning and the usage of external tools, such as weather and calculator APIs. However, real-world complex systems present three prevalent challenges concerning task planning and tool usage: (1) The real system usually has numerous APIs, so it is impractical to feed the descriptions of all APIs to the prompt of LLMs as the token length is limited; (2) the real system is designed for handling complex tasks, and the base LLMs can hardly plan a correct sub-task order and API-calling order for such tasks; (3) Similar semantics and functionalities among APIs in real systems create challenges for both LLMs and even humans in distinguishing between them. In response, this paper introduces a comprehensive framework aimed at enhancing the Task Planning and Tool Usage (TPTU) abilities of LLM-based agents within real-world systems. Our framework comprises three key components designed to address these challenges: (1) the API Retriever selects the most pertinent APIs among the extensive API set; (2) LLM Finetuner tunes a base LLM to enhance its capability for task planning and API calling; (3) the Demo Selector retrieves demonstrations related to hard-to-distinguish APIs, which is further used for in-context learning to boost the final performance. We validate our methods using a real-world industry system and an open-sourced academic dataset, demonstrating the efficacy of each individual component as well as the integrated framework.

PDF ICLRW OpenReview Semantic Scholar

Cite

Text

Kong et al. "TPTU-V2: Boosting Task Planning and Tool Usage of Large Language Model-Based Agents in Real-World Systems." ICLR 2024 Workshops: LLMAgents, 2024.

Markdown

[Kong et al. "TPTU-V2: Boosting Task Planning and Tool Usage of Large Language Model-Based Agents in Real-World Systems." ICLR 2024 Workshops: LLMAgents, 2024.](https://mlanthology.org/iclrw/2024/kong2024iclrw-tptuv2/)

BibTeX

@inproceedings{kong2024iclrw-tptuv2,
  title     = {{TPTU-V2: Boosting Task Planning and Tool Usage of Large Language Model-Based Agents in Real-World Systems}},
  author    = {Kong, Yilun and Ruan, Jingqing and Chen, YiHong and Zhang, Bin and Bao, Tianpeng and Shiwei, Shi and Qing, Du Guo and Hu, Xiaoru and Mao, Hangyu and Li, Ziyue and Zeng, Xingyu and Zhao, Rui and Wang, Xueqian},
  booktitle = {ICLR 2024 Workshops: LLMAgents},
  year      = {2024},
  url       = {https://mlanthology.org/iclrw/2024/kong2024iclrw-tptuv2/}
}