frontpage.

Hello HN!

I’ve had various experiments/lightweight projects that make occasional calls to various providers, and just wanted a very simple and configurable way to automatically triage the models for slightly longer running tasks.

Cascade is a super simple, single dependency-free Python script that turns free-tier AI API keys into one always-on chat endpoint.

It takes any combination of free provider keys (Groq, Cerebras, Gemini, Mistral, OpenRouter, Cloudflare, SambaNova, Nvidia NIM, etc.) and Cascade automatically:

- Discovers available models - Ranks them best → worst - Routes prompts to the best available model - Detects rate limits, quota exhaustion, outages, and unsupported models - Fails over to the next-best model automatically

It works as both:

- An interactive CLI chat client - An OpenAI-compatible REST API (`/v1/chat/completions`)

Supports ~18 OpenAI-compatible providers today, including a keyless OVHcloud fallback. Run `/providers` to see connected providers and add more keys at any time.

I’m sure there’s solutions out there (incl. some of the providers listed), this isn’t a product; just a very simple solve for a very simple issue, sharing for those like myself looking for a more CLI-focused or configurable approach!

Nvidia Halos

All time best of Split Depth GIFs

Show HN: A voxel editor for decorating a home for a Tamagotchi-like creature

Two Singaporean brothers turns unsolvable math into post-quantum encryption

Show HN: Ziex, a Zig web framework reaching its first release

Moebius: 0.2B image inpainting model with 10B-level performance

Trump unveiled Qatar's gifted Air Force One this week

Tesla driver says it was on Autopilot before fatal Texas home crash

AI Is Boosting Productivity at Home – But Not Equally

Alan Greenspan's Essay: "Gold and Economic Freedom"

Ask HN: Is there still value in making apps?

AI effect: People are taking up skills for no money, just to feel human

China Became an Energy Superpower [video]

Robotic exoskeleton could redefine how stroke survivors relearn to walk

Polaris: A Native macOS App for Kamal

Hacker News Games in Videos

Extreme Dynamic Symmetry Enables Omnidirectional and Multifunctional Robots

Fungus-Growing Ants

AI will create more jobs for humans, not replace them, says Bezos

QSOE: QNX-inspired OS with dual-kernel architecture

Never Give Them Your Face

Former Unreal Engine 'Lead Evangelist' Sjoerd De Jong Leaves Epic Games

Show HN: GeoTag Photos – Add GPS coordinates to photos in the browser

Why European housing politics should be Americanized

Retrieval Debt: The Technical Debt Your Agent Is Paying

The Coming Enshittification of AI

Have Data Centers Raised Your Electric Bill? Causal Evidence from the US

DNA from 2k-year-old grape seeds points to origins of modern winemaking

Keogram: The Sky in 2025

Chevron signs 20-year power agreement with Microsoft for West Texas data center