Show HN: LLMuxer – Find the cheapest LLM that keeps your accuracy

2•mihir_ahuja•5mo ago

I built LLMuxer because I kept defaulting to GPT-4o for everything, even simple tasks where a smaller, cheaper model would have done just fine.

It runs your prompts or dataset (currently for classification tasks) across multiple models, compares performance vs. cost, and recommends the best value so you’re not wasting tokens or budget.

Would love feedback and ideas for features you’d want before using it in your own workflow!

Comments

mihir_ahuja•5mo ago

!pip install llmuxer llmuxer run --dataset banking77_test.json --models openai:gpt-4o-mini openai:gpt-4o claude:claude-3-5

mihir_ahuja•5mo ago

LLMuxer automates cost–accuracy trade-offs across LLM providers through OpenRouter, so you can easily benchmark and compare dozens of models with one API key.

Google in Your Terminal

Shannon: Claude Code for Pen Testing

Anthropic: Latest Claude model finds more than 500 vulnerabilities

Brooklyn cemetery plans human composting option, stirring interest and debate

Why the 'Strivers' Are Right

Brain Dumps as a Literary Form

Agentic Coding and the Problem of Oracles

Malicious packages for dYdX cryptocurrency exchange empties user wallets

Show HN: I built a <400ms latency voice agent that runs on a 4gb vram GTX 1650"

Penisgate erupts at Olympics; scandal exposes risks of bulking your bulge

Arcan Explained: A browser for different webs

What did we learn from the AI Village in 2025?

An open replacement for the IBM 3174 Establishment Controller

The P in PGP isn't for pain: encrypting emails in the browser

Show HN: Mirror Parliament where users vote on top of politicians and draft laws

Ask HN: Opus 4.6 ignoring instructions, how to use 4.5 in Claude Code instead?

We Mourn Our Craft

Jim Fan calls pixels the ultimate motor controller

Exploring a Modern SMTPE 2110 Broadcast Truck with My Dad

AI UX Playground: Real-world examples of AI interaction design

The Field Guide to Design Futures

The Other Leverage in Software and AI

AUR malware scanner written in Rust

Free FFmpeg API [video]

Are AI agents ready for the workplace? A new benchmark raises doubts

Show HN: AI Watermark and Stego Scanner

Clarity vs. complexity: the invisible work of subtraction

Solid-State Freezer Needs No Refrigerants

Ask HN: Will LLMs/AI Decrease Human Intelligence and Make Expertise a Commodity?

From Zero to Hero: A Brief Introduction to Spring Boot