frontpage.

Hi HN,

We’re building https://www.switchpoint.dev – a drop-in replacement for OpenAI’s API that reduces LLM cost by smartly routing across models (e.g., Claude, Gemini, GPT-4) depending on subject and difficulty of the task.

Why we built this: LLM costs are spiraling—especially for products doing retrieval, agentic reasoning, or even just high-volume chat. We were frustrated with paying GPT-4 rates when most queries didn’t need it. So we built a router that:

- Starts with cheaper/free models (like Llama 8B, 4o-mini, 2.0 flash) - Streams responses and upgrades on failure - Acts like a single OpenAI-compatible endpoint For enterprise, you can define fallback logic and some custom routing logic as well. It’s plug-and-play.

Designed for agents and RAG systems: We’re exploring integrations with open-source coding agents and autonomous frameworks. If you maintain one or use one, we’d love to collaborate, or even just get your thoughts on whether this would help your stack.

We’d love any and all feedback—features, bugs, edge cases, your use cases. Would this be useful to you? Is it solving a real problem?

Thanks for checking it out!

Community Fact-Checks Do Not Break Follower Loyalty

Show HN: Inspirit Labs

Would I do a PhD again?

A new board game simulates how a US-China war would be fought

Rust 1.0, Ten Years Later

Will satellite dogfights be the final frontier for the US-China space rivalry?

Guide to Avoiding PFAS Chemicals [pdf]

MCP server to run AppleScript and JXA

BuyMeACoffee silently dropped support for many countries, and nobody cares

LLVM IR – Past, Present and Future [video]

We're faster than a Mac (Microsoft Ad) [video]

Show HN: Proddigy – Productivity Lab (iOS app)

Classical "Single user computers" were a flawed or at least limited ideas

Ash AI: A Comprehensive LLM Toolbox for Ash Framework

Foxconn stops sending Chinese workers to India iPhone factories

Character.ai opens a back door to free speech rights for chatbots

Amazon considers that warehouse robots "flatten" its hiring curve

Unravelling T-Strings

Superposition of Features Creates Power Law Performance in LLMs

Polio Outbreak in Papua New Guinea

Rust 1.87.0 and ten years of Rust

Auto engine stop-start tech in new cars could be banned in the US

New Video editor SDK out

Interactive Formal Specifications

Putin's New Hermit Kingdom

Rust Turns 10

The Coming of Post-Industrial Society [pdf]

Ask HN: Tech Test

Show HN: OmniWeb

Climeworks' capture fails to cover its own emissions

Show HN: Switchpoint AI – Cut LLM Cost and Improve Performance with Auto Routing