Bolaa: Benchmarking and Orchestrating LLM Autonomous Agents
Abstract
The massive successes of large language models (LLMs) encourage the emerging exploration of LLM-augmented Autonomous Agents (LAAs). An LAA is able to generate actions with its core LLM and interact with environments, which facilitates the ability to resolve complex tasks by conditioning on past interactions such as observations and actions. Since the investigation of LAA is still very recent, limited explorations are available. Therefore, we provide a comprehensive comparison of LAA in terms of both agent architectures and LLM backbones. Additionally, we propose a new strategy to orchestrate multiple agents such that each agent focuses on one specific tasks, i.e. BOLAA, where a controller manages the communication among multiple agents. We conduct simulations on both decision-making and multi-step reasoning environments, which comprehensively justify the capacity of LAAs. Our performance results provide quantitative suggestions for designing LAA architectures and the optimal choice of LLMs, as well as the compatibility of both.
Cite
Text
Liu et al. "Bolaa: Benchmarking and Orchestrating LLM Autonomous Agents." ICLR 2024 Workshops: LLMAgents, 2024.Markdown
[Liu et al. "Bolaa: Benchmarking and Orchestrating LLM Autonomous Agents." ICLR 2024 Workshops: LLMAgents, 2024.](https://mlanthology.org/iclrw/2024/liu2024iclrw-bolaa/)BibTeX
@inproceedings{liu2024iclrw-bolaa,
title = {{Bolaa: Benchmarking and Orchestrating LLM Autonomous Agents}},
author = {Liu, Zhiwei and Yao, Weiran and Zhang, Jianguo and Xue, Le and Heinecke, Shelby and Rithesh, R N and Feng, Yihao and Chen, Zeyuan and Niebles, Juan Carlos and Arpit, Devansh and Xu, Ran and Mui, Phil L and Wang, Huan and Xiong, Caiming and Savarese, Silvio},
booktitle = {ICLR 2024 Workshops: LLMAgents},
year = {2024},
url = {https://mlanthology.org/iclrw/2024/liu2024iclrw-bolaa/}
}