<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>13</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Seamus Somerstep</style></author><author><style face="normal" font="default" size="100%">Felipe Maia Polo</style></author><author><style face="normal" font="default" size="100%">Allysson Flavio Melo de Oliveira</style></author><author><style face="normal" font="default" size="100%">Prattyush Mangal</style></author><author><style face="normal" font="default" size="100%">Mírian Silva</style></author><author><style face="normal" font="default" size="100%">Onkar Bhardwaj</style></author><author><style face="normal" font="default" size="100%">Mikhail Yurochkin</style></author><author><style face="normal" font="default" size="100%">Subha Maity</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">CARROT: A Cost Aware Rate Optimal Router</style></title></titles><dates><year><style  face="normal" font="default" size="100%">2025</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://arxiv.org/abs/2502.03261</style></url></web-urls></urls><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">With the rapid growth in the number of Large Language Models (LLMs), there has been a recent interest in LLM routing, or directing queries to the cheapest LLM that can deliver a suitable response. Following this line of work, we introduce CARROT, a Cost AwaRe Rate Optimal rouTer that can select models based on any desired trade-off between performance and cost. Given a query, CARROT selects a model based on estimates of models' cost and performance. Its simplicity lends CARROT computational efficiency, while our theoretical analysis demonstrates minimax rate-optimality in its routing performance. Alongside CARROT, we also introduce the Smart Price-aware Routing (SPROUT) dataset to facilitate routing on a wide spectrum of queries with the latest state-of-the-art LLMs. Using SPROUT and prior benchmarks such as Routerbench and open-LLM-leaderboard-v2 we empirically validate CARROT's performance against several alternative routers.</style></abstract></record></records></xml>