TaskCraft: Automated Generation of Agentic Tasks

Abstract

Agentic tasks, which require multistep problem solving with tool use and adaptive reasoning, are becoming increasingly central to the advancement of NLP and AI. Although benchmarks such as GAIA and BrowseComp have advanced agent evaluation, their scalability remains limited by the high cost of human annotation. We introduce TaskCraft, the first automated workflow for generating scalable, multitool, and verifiable agentic tasks of difficulty. TaskCraft progressively complexifies atomic tasks through depth-based and width-based extensions, with incremental validation via rejection sampling and LLM-based linguistic analysis, ensuring both scalability and efficiency. The generated tasks enable trajectory sampling within state-of-the-art workflows, supporting end-to-end SFT and RL training. Experimental results on multiple LLMs show that TaskCraft data substantially improves multi-hop reasoning and agentic capabilities. Further scaling with TaskCraft tasks and applying RL training yields additional gains, achieving state-of-the-art performance on four agentic benchmarks. The resulting dataset comprises 41k tool-intensive tasks across varied difficulty levels, including 12.6k tool-interaction trajectories and 5k multihop decompositions.

Cite

Text

Shi et al. "TaskCraft: Automated Generation of Agentic Tasks." International Conference on Learning Representations, 2026.

Markdown

[Shi et al. "TaskCraft: Automated Generation of Agentic Tasks." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/shi2026iclr-taskcraft/)

BibTeX

@inproceedings{shi2026iclr-taskcraft,
  title     = {{TaskCraft: Automated Generation of Agentic Tasks}},
  author    = {Shi, Dingfeng and Cao, Jingyi and Chen, Qianben and Sun, Weichen and Li, Weizhen and Lu, Hongxuan and Dong, Fangchen and Qin, Tianrui and Zhu, King and Liu, Minghao and Jiang, Yuchen Eleanor and Yang, Jian and Zhang, Ge and Liu, Jiaheng and Zhang, Changwang and Wang, Jun and Zhou, Wangchunshu},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/shi2026iclr-taskcraft/}
}