frontpage.

Show HN: LLMWise – Compare, Blend, and Judge LLM Outputs from One API

https://llmwise.ai/

1•dm118•1h ago

The core idea is that no single LLM is best at everything, so we built orchestration primitives that let you combine them intelligently via a single API.

Mixture-of-Agents (MoA): Our /blend endpoint implements multi-layer MoA. You send a prompt to 2-6 models in parallel, then each model refines its answer using the other models' outputs as reference material. This runs for 1-3 configurable layers before a synthesizer model produces the final response. We also built a Self-MoA variant: a single model generates 2-8 diverse candidates using temperature variation and distinct agent prompts ("prioritize correctness", "anticipate edge cases", "be skeptical"), then synthesizes the best parts. Six blend strategies total: consensus, council, best_of, chain, moa, and self_moa.

Circuit breakers: Every model has a health tracker with a classic closed to open to half-open state machine. Three consecutive failures trips the circuit for 30 seconds. When a model is down, mesh routing automatically skips it and tries the fallback chain, so no wasted latency on providers that are having a bad day. The SSE stream emits route events so you can see exactly what happened: trying, failed, skipped(circuit_open), trying, success. OpenRouter gets its own tuned thresholds (6 consecutive 429s, 20s cooldown) because rate limits there behave differently than hard failures.

Auto-router: model: "auto" does zero-overhead heuristic routing, pure regex classification, no LLM call. Code goes to GPT, math/creative goes to Claude, translation goes to Gemini Flash, etc. Simple, fast, and surprisingly effective for common queries.

Other things that were fun to build:

- Credit settlement with margin targeting: we reserve credits upfront, then reconcile against actual provider cost after the response completes - Per-user semantic memory via pgvector: conversations build retrievable context across sessions - BYOK encryption (Fernet/AES-128) so you can bring your own API keys and skip our billing entirely

The whole backend is async Python (FastAPI + asyncpg + LiteLLM), frontend is static Next.js served by the same FastAPI process in production. Single Docker image on Railway.

For the technically curious: https://llmwise.ai/llms-full.txt has the complete platform documentation in plain text, and there's also a machine-readable view at https://llmwise.ai/ai designed for AI agents to consume.

A chat room where LLM bots pretend to be human and everyone hunts each other

Goatstack: Project scaffolding tool for Go and Templ webapps

Airtable Is Down

No Compromise Pure Golang Version of Haivision's SRT (Secure Reliable Transport)

Show HN: VarLiNGAM-rs / Causal discovery in Rust, 50x faster than Python

Metallic material breaks 100-year thermal conductivity record (1100 Wm^−1 K^−1)

How I made a shooter game in 64 KB [video]

NASA targets March 6 for Artemis 2 launch to take astronauts around the Moon

Predator spyware defeats iOS recording indicators

Find and fix vulnerable dependencies with govulncheck

Shai-Hulud-Style NPM Worm Hijacks CI Workflows and Poisons AI Toolchains

Design time vs. Run time in Agentic engineering

ICE agents could be banned from getting public jobs in N.J. for life

Show HN: Orpheus – PR review that runs the code

The Rise and Fall of Nuremberg Christianity

Prompt Repetition Improves Non-Reasoning LLMs

Satisfies Never Off by One

Integrity of a Shared Filesystem

Viral Child Soldiers on TikTok

Show HN: Ember MCP – local persistent memory for LLMs, kills stale memories

I built a live honeypot that catches AI agents. Here's what happened

America is at risk of becoming an automotive backwater

Taalas HC1: The Chip That Can't Change Its Mind

Show HN: What Did I Watch? – Describe any movie/show and AI finds it

Packets at Line Rate: How to Actually Use AF_XDP

If LLMs Only Predict the Next Token, Why Do They Work?

Show HN: Fusion 360 G-Code Optimizer

Layers of AI Memory

Trump plans 10% global tariff, says refund fights may take years

LibreOffice attacks OnlyOffice as "fake open source" over Microsoft ties