CARROT: A Cost Aware Rate Optimal Router

Abstract

With the rapid growth in the number of Large Language Models (LLMs), there has been a recent interest in *LLM routing*, or directing queries to the cheapest LLM that can deliver a suitable response. Following this line of work, we introduce CARROT, a Cost AwaRe Rate Optimal rouTer that can select models based on any desired trade-off between performance and cost. Given a query, CARROT selects a model based on estimates of models' cost and performance. Its simplicity lends CARROT computational efficiency, while our theoretical analysis demonstrates minimax rate-optimality in its routing performance. Alongside CARROT, we also introduce the Smart Price-aware Routing (SPROUT) dataset to facilitate routing on a wide spectrum of queries with the latest state-of-the-art LLMs. Using SPROUT and prior benchmarks such as Routerbench and open-LLM-leaderboard-v2 we empirically validate CARROT's performance against several alternative routers.

Cite

Text

Somerstep et al. "CARROT: A Cost Aware Rate Optimal Router." ICLR 2025 Workshops: FM-Wild, 2025.

Markdown

[Somerstep et al. "CARROT: A Cost Aware Rate Optimal Router." ICLR 2025 Workshops: FM-Wild, 2025.](https://mlanthology.org/iclrw/2025/somerstep2025iclrw-carrot/)

BibTeX

@inproceedings{somerstep2025iclrw-carrot,
  title     = {{CARROT: A Cost Aware Rate Optimal Router}},
  author    = {Somerstep, Seamus and Polo, Felipe Maia and de Oliveira, Allysson Flavio Melo and Mangal, Prattyush and Silva, Mírian and Bhardwaj, Onkar and Yurochkin, Mikhail and Maity, Subha},
  booktitle = {ICLR 2025 Workshops: FM-Wild},
  year      = {2025},
  url       = {https://mlanthology.org/iclrw/2025/somerstep2025iclrw-carrot/}
}