Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum

Gao, Shen; Shi, Zhengliang; Zhu, Minghang; Fang, Bowen; Xin, Xin; Ren, Pengjie; Chen, Zhumin; Ma, Jun; Ren, Zhaochun

doi:10.1609/AAAI.V38I16.29759

Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum

Shen Gao, Zhengliang Shi, Minghang Zhu, Bowen Fang, Xin Xin, Pengjie Ren, Zhumin Chen, Jun Ma, Zhaochun Ren

AAAI 2024 pp. 18030-18038

doi:10.1609/AAAI.V38I16.29759 /aaai/2024/gao2024aaai-confucius/

Abstract

Augmenting large language models (LLMs) with external tools has emerged as a promising approach to extending the capability of LLMs. Although there are some works that employ open-source LLMs for the tool-learning task, most of them are trained in a controlled environment in which LLMs only learn to execute the human-provided tools. However, selecting proper tools from the large toolset is also a crucial ability for the tool-learning model to be applied in real-world applications. Existing methods usually directly employ self-instruction methods to train the model, which ignores differences in tool complexity. In this paper, we propose the Confucius a novel tool-learning framework to train LLM to use complicated tools in real-world scenarios, which contains two main phases: (1) We first propose a multi-stage learning method to teach the LLM to use various tools from an easy-to-difficult curriculum; (2) thenceforth, we propose the Iterative Self-instruct from Introspective Feedback (ISIF) to dynamically construct the dataset to improve the ability to use the complicated tool. Extensive experiments conducted on both controlled and real-world settings demonstrate the superiority of our tool-learning framework in the real-world application scenario compared to both tuning-free (e.g., ChatGPT, Claude) and tuning-based baselines (e.g., GPT4Tools).

PDF AAAI Semantic Scholar

Cite

Text

Gao et al. "Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I16.29759

Markdown

[Gao et al. "Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/gao2024aaai-confucius/) doi:10.1609/AAAI.V38I16.29759

BibTeX

@inproceedings{gao2024aaai-confucius,
  title     = {{Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum}},
  author    = {Gao, Shen and Shi, Zhengliang and Zhu, Minghang and Fang, Bowen and Xin, Xin and Ren, Pengjie and Chen, Zhumin and Ma, Jun and Ren, Zhaochun},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {18030-18038},
  doi       = {10.1609/AAAI.V38I16.29759},
  url       = {https://mlanthology.org/aaai/2024/gao2024aaai-confucius/}
}