- benchmarks routing cost-optimization
We ran 200 questions per model. Here is what we found.
Routerly routing policies matched Claude Sonnet 4.6 accuracy on MMLU and HumanEval while cutting costs by up to 69%.
Carlo Satta5 min read - announcement routing docker
Working toward Routerly v0.2.0
A short update on the Routerly v0.2.0 work: semantic-intent routing, version update visibility, dev installs, Docker build improvements, and bug fixing.
Carlo Satta5 min read - benchmarks routing cost-optimization
LLM routing policies work: what three benchmarks confirm
Three benchmarks validate LLM-based routing policies. Cost savings are confirmed on all tasks; the right success metric depends on the use case.
Carlo Satta7 min read - benchmarks routing performance
Measuring Routerly: MMLU, HumanEval, and BIRD Benchmarks
We published routerly-benchmark, an open suite that measures the impact of intelligent routing on quality, cost, and latency across three standard AI evaluation tasks. Here is how it works and what we found.
Carlo Satta4 min read - release v0.1.5 routing
Routerly v0.1.5: First Public Release
The first tagged release of Routerly ships 9 routing policies, multi-tenant project isolation, a built-in web dashboard, a full admin CLI, and a one-line installer for macOS, Linux, and Windows.
Carlo Satta4 min read