The Architecture of Existing: Why LLM Routing Isn’t About Cost
When we talk about LLM routing, the conversation usually goes like this: “GPT-4 is expensive. Local models are cheap. Let’s route simple queries to cheap models and save money.” This framing is wrong. Or at least, it misses the point. The Cost Frame The technical approach is sound. RouteLLM, a framework from LMSYS, demonstrates you…
