Show HN: I built CostLens SDK to cut my AI bills by routing to cheaper models

https://costlens.dev/

2•j_filipe•1h ago

My OpenAI bills were getting out of hand - I was using GPT-4 for everything, even simple tasks that GPT-3.5 could handle perfectly.

So I built CostLens. It's a drop-in replacement that automatically routes requests to cheaper models when possible, but falls back to premium ones when quality matters.

How it works: js // Just swap this: const openai = new OpenAI({ apiKey: 'sk-...' });

// For this: const costlens = new CostLens(); const openai = costlens.openai({ apiKey: 'sk-...' }); // Everything else stays exactly the same

Real savings: • Simple tasks: GPT-4 → GPT-4o-mini (95% cheaper) • Complex tasks: Still uses GPT-4 when needed • My bills dropped ~70% with zero code changes

Features: • Quality detection (auto-retries with better models if response is bad) • Works with existing code - no prompt changes needed • Caching with Redis • Instant mode (no signup required)

Try it: npm install costlens

The core SDK is free and works locally. I'm also building a dashboard for teams to track their AI spending.

NPM: https://www.npmjs.com/package/costlens

Anyone else tired of overpaying for AI APIs? What's your biggest cost pain point?

Comments

j_filipe•1h ago

Hey everyone!

I'm the dev behind this. Started as a weekend project because I kept getting sticker shock from my OpenAI bills. I'd use GPT-4 for literally everything - even "fix this typo" type requests that cost 20x more than they should.

The breakthrough was realizing most requests don't actually need the expensive models. So I built quality detection that tries the cheap model first, then upgrades only if the response is garbage.

Been using it in production for 3 months now. Went from ~$400/month to ~$120/month with zero changes to my actual prompts or code. The quality detection catches about 15-20% of requests that need the premium models.

Works with both OpenAI and Anthropic - Claude Opus → Claude Haiku saves even more than the OpenAI routing since the price gap is bigger.

Happy to answer any questions! The trickiest part was getting the quality scoring right - too aggressive and you get bad responses, too conservative and you don't save money.

Also working on a team dashboard, but wanted to get the core SDK out there first since it's been so useful for me.

I finally understand Cloudflare Zero Trust tunnels

Researchers say our solar system is moving impossibly fast

Practical Data Privacy (Book)

How LimeWire ended the Napster music revolution

Scraping Hacker News job posts and filtering them for remote roles

Ask HN: What works to learn mathematical problem solving?

Show HN: A desktop app to manage Claude Code config

Stronger Adaptive Attacks Bypass Defenses Against LLM Jailbreaks

Shouting at seagulls could stop them stealing your food

Ask HN: Cloud providers are losing in favor of bare-metal?

'No point making a high-spec Steam Machine,' Larian publishing boss says

Owning a Cat Could Double Your Risk of Schizophrenia, Research Suggests

War with the Anglo-Saxons: How Britain became Russia's enemy number one

Data centre in the shed reduces energy bills to £40

Apply to Ampersand U

ICLR review with 40 weaknesses and 40 additional questions

Show HN: PolyAgora – A natural-language multi-agent OS built with GPT-5.1

The Best General View of Yosemite

Is Neon's price drop just coming from moving to Databricks AWS account?

Breaking the Humanoid Robot Delusion

'Not in our name': Afrikaners push back against Trump's genocide claims

Meridian: A Design Framework for Malleable Overview-Detail Interfaces

Show HN: AI Hub – Android all in one app for AIs

DoorDash to pay $18M to settle City Hall lawsuit

Foundation Interface Lab

Only three kinds of AI products work

The Laffer curve for high incomes (2017)

Electricity bills in states with the most data centers are surging

America Is All-In on Deep Learning; China Emphasises Robotics and Hardware

Women riding Tehran streets on motorbikes latest sign of Iran societal change