frontpage.

We just open-sourced LLM Router, a framework that lets you route user prompts to the most suitable large language model (LLM) automatically optimizing for speed, cost, and accuracy. If you have multiple LLMs (open or proprietary) and want to use the right one for each task, LLM Router acts as a drop-in OpenAI API-compatible proxy. It classifies each prompt (e.g., code generation, QA, rewrite) and sends it to the best backend, all configurable via YAML.

Built for production: Rust backend, NVIDIA Triton integration, metrics for monitoring.

Flexible policies: Use built-in classifiers or plug in your own PyTorch models.

Easy integration: No major code changes needed, point your client at the router.

Example: Route complex coding questions to a powerful model, and simple rewrites to a smaller, cheaper one.

Repo: github.com/NVIDIA-AI-Blueprints/llm-router

Would love feedback, ideas, and to hear how others are handling multi-LLM workflows!

The future of web development is AI. Get on or get left behind

Can you smuggle data in an ID card photo?

Deco Dilemmas: The Push for Personalized Decompression Modeling

Chinese exporters 'wash' products in third countries to avoid Trump's tariffs

An Interactive Debugger for Rust Trait Errors

In Defense of William Shatner

Show HN: Mcp-testing-kit to unit test your MCP server

Microsoft Is Key Holdout for OpenAI Restructuring Plan

'I Don't Know Where You Are': The Race to Fix Air-Traffic Control

I Build with LLMs

The second birth of JMW Turner

My 7 Step Strategy to Fix Rags

Summer of Math Exposition SoME4 (summer 2025)

Ask HN: Did Aliexpress stop shipping to US?

Show HN: API Testing and Security with AI

Show HN: Visual knowledge graph for nutrition and health claims

Hacker 'NullBulge' pleads guilty to stealing Disney's Slack data

Wrapping Paper Turns All Your Presents into Bread

Stop Using Encrypted Email (2020)

Two Meets Leon

Show HN: Claity AI – An AI Aggregator with Smart Prompt Routing (Join Waitlist)

OpenAI agrees to buy Windsurf for about $3B

22-inch foldable external display

How to Understand That Jepsen Report

RSC for Astro Developers

Trump proposes unprecedented budget cuts to US science

Executive Order protecting Americans from dangerous gain-of-function research

Have you ever wondered what's the ultimate physical limit of data storage?

An Appeal to Apple from Anukari: one tiny macOS detail to make Anukari fast

New Zealand's prime minister proposes social media ban for under-16s

Show HN: LLM Router – Open-source prompt router for multi-LLM deployments

Comments