frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Slash LLM API Costs with This Open-Source Gateway

https://twitter.com/realedgelab/status/2022351000989012015
1•mylxsw•1h ago

Comments

mylxsw•1h ago
Just open-sourced Squirrel — an LLM API Gateway built to solve the nightmare of managing multiple models, providers, and prompts across different projects.

If you are building AI apps, managing agents, or running backend services, you have probably hit these walls:

Upgrading models is a grind. Updating hardcoded strings across 10+ repositories takes too much time.

Bleeding money blindly. Provider prices fluctuate, and tracking costs across multiple vendors is impossible manually.

Debugging is pure guesswork. Without full request/response logs, fixing broken prompts is a shot in the dark.

I built Squirrel to fix exactly this. Here is what it does out of the box:

Model Mapping (Change once, apply everywhere) Stop hardcoding specific models like gpt-4o or claude-3.5-sonnet. Map a virtual name (like my-smart) to a provider in the gateway. Want to upgrade all your apps at once? Just update the mapping in Squirrel. It takes effect instantly across all projects with zero code changes required.

Cost-Based Auto-Routing Set your provider pricing, and Squirrel automatically routes requests to the cheapest available option. It also supports priority, weight-based, and round-robin strategies.

Complete Observability Logs every single API call, including streaming responses. Check the admin dashboard to see the exact prompt sent, the model's output, token usage, and Time to First Byte (TTFB). This is an absolute lifesaver for debugging and fine-tuning.

Auto-Retry & Failover If a provider throws a 500 error or times out, Squirrel seamlessly switches to a backup provider. Your client-side code does not need to handle a thing.

Protocol Compatibility Works natively with OpenAI and Anthropic SDKs, and auto-translates protocols between them under the hood.

Tech Stack: Python (FastAPI) + Next.js + PostgreSQL/SQLite. One-click deployment via Docker Compose.

Fully open-source under the MIT license. It is still under active development, so feedback, issues, and PRs are incredibly welcome.

Check it out here: https://github.com/mylxsw/llm-gateway

WebMCP is available for early preview

https://developer.chrome.com/blog/webmcp-epp
1•mywacaday•48s ago•0 comments

A Brief History of Xenopus

https://www.asimov.press/p/xenopus
1•surprisetalk•2m ago•0 comments

Rethinking High-School Science Fairs

https://asteriskmag.com/issues/13/rethinking-high-school-science-fairs
1•surprisetalk•2m ago•0 comments

Memory Safety Is

https://matklad.github.io/2025/12/30/memory-safety-is.html
1•surprisetalk•2m ago•0 comments

Show HN: A private, bulk audio converter using WASM (186x real-time speed)

https://vocalremover.dev/bulk-converter/
1•jacoka•2m ago•1 comments

Your company is not a filesystem

https://twitter.com/anvishapai/status/2022062725354967551
1•anvisha•3m ago•0 comments

The Sharp PC-2000 Computer Boombox from 1979

https://stereo2go.com/forums/threads/one-of-the-rarest-the-sharp-pc-2000-computer-boombox-from-19...
1•coloneltcb•4m ago•0 comments

How The Times Is Digging Into Millions of Pages of Epstein Files

https://www.nytimes.com/2026/02/12/insider/jeffrey-epstein-files-documents.html
1•thm•7m ago•0 comments

Show HN: DotNetDevs – a reverse job board for .NET developers

https://dotnetdevs.net/
1•Nannooskeeska•7m ago•0 comments

Rustbridge: A framework for Rust libraries callable from other languages

https://github.com/jrobhoward/rustbridge
1•amatheus•7m ago•0 comments

Microsoft AI chief: 18 months for all white-collar work to be automated

https://fortune.com/2026/02/13/when-will-ai-kill-white-collar-office-jobs-18-months-microsoft-mus...
2•geox•8m ago•4 comments

The Final Bottleneck

https://lucumr.pocoo.org/2026/2/13/the-final-bottleneck/
1•coloneltcb•8m ago•0 comments

MSP Pentesting Using AI as a Service?

https://www.msppentesting.com/automated-and-ai-pentesting
1•0xcady•9m ago•1 comments

French police arrest nine people over suspected €10M Louvre ticket fraud

https://www.theguardian.com/world/2026/feb/13/french-police-arrest-people-suspected-louvre-ticket...
1•stevekemp•9m ago•0 comments

Free Soundboard for Twitch and Kick Streamers in 2026

https://killervibe.app/blog/best-soundboard-for-twitch-streamers-2026
1•Jikouken•11m ago•1 comments

SkyRL Brings Tinker to Your GPUs

https://novasky-ai.notion.site/skyrl-tinker
1•robertnishihara•13m ago•0 comments

Go! The AI Personal Operating System for Your Life

https://app.gosmartchain.ai/
1•Anthony_Go•14m ago•1 comments

code-simplifier.md

https://github.com/getsentry/skills/blob/main/plugins/sentry-skills/agents/code-simplifier.md
1•tosh•16m ago•0 comments

An API for Your Brain

https://arxiv.org/abs/2602.11632
1•bwjx•17m ago•0 comments

The consequences of task switching in supervisory programming

https://martinfowler.com/fragments/2026-02-13.html
2•bigwheels•20m ago•0 comments

'A game-changer': UC San Diego professor initiates new field of medical science

https://www.sandiegouniontribune.com/2026/02/12/a-game-changer-uc-san-diego-professor-initiates-a...
1•mikhael•21m ago•0 comments

Show HN: Alarm Arcade – an alarm app that only stops after you beat a mini-game

https://apps.apple.com/us/app/alarm-arcade-beat-the-clock/id6758615211
1•metehankasap•22m ago•0 comments

Expedition 33 art book confiscated because officials think it's an ancient relic

https://www.polygon.com/expedition-33-art-book-monolith-set-customs-ancient-artifact/
1•croes•22m ago•0 comments

Adversarial Patch: images that make classifiers ignore other items in a scene

https://arxiv.org/abs/1712.09665
1•felineflock•23m ago•0 comments

Opinion: NATO Has Seen the Future and Is Unprepared

https://www.wsj.com/opinion/nato-has-seen-the-future-and-is-unprepared-887eaf0f
1•alecco•24m ago•1 comments

Putin Didn't Know How Good He Had It

https://www.theatlantic.com/international/2026/02/putin-trump-world-order/685988/
4•JumpCrisscross•25m ago•1 comments

Elon vs. MongoDB- feature by feature comparison

https://www.devtoolsacademy.com/blog/eloqdoc-vs-mongodb-feature-comparison/
1•alokDT•26m ago•0 comments

Postgres Locks Explained: From Theory to Advanced Troubleshooting

https://postgreslocksexplained.com/
3•theotherbrian1•26m ago•1 comments

Evolving Git for the Next Decade

https://lwn.net/SubscriberLink/1057561/bddc1e61152fadf6/
2•todsacerdoti•28m ago•1 comments

Show HN: I built an AI that generates love proposals as pitch decks

https://aiproposal.fun/
1•pruthviraj900•28m ago•0 comments