frontpage.

Hey HN,

I built CostRouter because I noticed 70-80% of our AI API calls didn't need GPT-4o/5. Simple text extraction, basic Q&A, formatting — all going to the most expensive model.

CostRouter is an API gateway that scores each request's complexity (0-100) and routes it to the cheapest model that can handle it:

- Simple queries → Llama 4 Scout ($0.0001/1K tokens) - Medium → Gemini 3 Flash ($0.0005/1K tokens) - Complex reasoning → stays on GPT-5.2 or Claude Opus

Integration is one line — just change your base_url:

    client = OpenAI(
        api_key="crx_live_...",
        base_url="https://cost-router-alex10020s-projects.vercel.app/api/v1"
    )

Pricing: we charge 10% of what we save you. If we save you $0, you pay $0.

In a 100K request/month test: - Before: $3,127/mo (all GPT-5.2) - After: $1,245/mo - Net savings after our fee: $1,694/mo

Tech stack: Next.js, Supabase, Vercel. The routing engine scores prompt complexity based on length, keyword analysis, and structural patterns, then maps to the cheapest model above the quality threshold.

Would love feedback on the routing approach and pricing model.

https://cost-router-alex10020s-projects.vercel.app

FlowViz – A free, zero-login Mermaid diagram editor

British tourist among 20 charged in Dubai over videos of Iranian missile strikes

Mapping production AI agents to IAM roles, tools, and network exposure

Show HN: Slop or not – can you tell AI writing from human in everyday contexts?

Verified orchestration and cost tracking for Copilot CLI

Theremin Schematics

Straightforward descriptions of cybersecurity products. You're welcome

Is the sky falling for international enrollment?

Show HN: I've just launched my own API

How to build a sharable Claude Code agent with skills

Perlsky Is a Perl 5 Implementation of an at Protocol Personal Data Server

Show HN: Push-to-talk dictation for Android apps and terminal workflows

A.I. Incites a New Wave of Grieving Parents Fighting for Online Safety

CrackArmor: Multiple Vulnerabilities in AppArmor

Does Where You're Born Matter More Than How Hard You Work?

Show HN: OpenClaw-class agents on ESP32 (and the IDE that makes it possible)

Show HN: Turkish Sieve Engine – Full Prime Statistics Up to 10^14 and V2 Preview

Faster Bundler

Big Pork attacks California law on caging

A DOGE bro left Social Security with 500M records on a drive and expected pardon

How to Run Local LLMs with Claude Code (Unsloth)

AI assistants now equal 56% of global search engine volume

What is the strongest open source model for coding against Opus 4.6?

Whole-Brain Connectomic Graph Model Enables Whole-Body Locomotion Control in Fly

Patience – 3Sec Hold Game:)

Show HN: Homecastr - AI home price forecasts on a map

Show HN: DevNode.studio, 100% local dev tools to make back end work faster

Brex tests agents: by committing fraud

Cryo FAQ

AI Slop: A Slack API Rate Limiting Disaster

CostRouter – Cut AI API costs 60% by routing to the cheapest capable model