frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Nexus Gateway – Reduce LLM API Costs Using Semantic Caching

https://www.nexus-gateway.org/
2•Sunnyanand_dev•1h ago
Hi HN,

I'm building Nexus Gateway, an AI gateway that helps developers reduce LLM API costs.

Problem: Many applications send repeated or semantically similar prompts to LLMs, which leads to unnecessary API calls and higher costs.

Solution: Nexus Gateway uses semantic caching to detect similar prompts and serve cached responses instead of calling the LLM again.

Features: • Semantic caching to reduce repeated API calls • Multi-model support (OpenAI, Gemini, Llama, Anthropic) • BYOK support • PII protection and sovereign AI layer (in progress)

Goal: Reduce LLM costs by 40–70% while improving latency.

I’d really appreciate feedback from the community.

Website: https://www.nexus-gateway.org

Comments

Sunnyanand_dev•1h ago
Hi everyone — I’m the founder of Nexus Gateway.

The main reason I built this is because I noticed many AI applications repeatedly send very similar prompts to LLM APIs. That means developers end up paying for the same reasoning multiple times.

Nexus Gateway tries to solve this using semantic caching. Instead of only checking for exact prompt matches, it detects semantically similar prompts and can serve cached responses when appropriate.

Current features include: • Multi-model support (OpenAI, Gemini, Anthropic, Llama) • BYOK (Bring Your Own Key) • Semantic caching to reduce repeated API calls • Model routing

I'm currently also working on: • PII protection layers • Sovereign AI support for regulated industries like banks and hospitals

My goal is to build an infrastructure layer that helps teams reduce LLM costs and improve latency without changing much of their existing code.

I’d love feedback from the community — especially around: • semantic caching strategies • similarity thresholds • enterprise security requirements

Happy to answer any technical questions.

Show HN: Nostr DM bot – control OpenCode/Cursor via DMs, pay with cashu tokens

https://github.com/dhalsim/nostr-dm-agent
1•mannix-canoe•59s ago•0 comments

Robots Won't Fold Your Laundry: The Biomllion-Dollar Humanoid Roboics Gap

https://www.firgelli.com/blogs/news/kung-fu-robots-wont-fold-your-laundry-billion-dollar-gap
2•RobbieDickson•4m ago•0 comments

Show HN: Anki(-Ish) for Music Theory

https://chordreps.com
2•interbolt_colin•5m ago•0 comments

Show HN: Conserved amino acid contacts across 70k protein structures

https://protein-interaction-conservation-production.up.railway.app/
1•xnullportx•5m ago•0 comments

Epic and Google have signed a special deal for a new class of 'metaverse' apps

https://www.theverge.com/tech/889526/epic-games-google-deal-metaverse-apps
2•LorenDB•5m ago•1 comments

China ramps up 'high stakes' tech race with US as economic imbalances deepen

https://www.reuters.com/world/china/china-parliament-approve-growth-policy-plans-amid-growing-us-...
2•tartoran•6m ago•0 comments

Surprising Gender Biases in GPT

https://www.sciencedirect.com/science/article/pii/S2451958824001660
4•Ambolia•8m ago•0 comments

Kristi Noem Out at U.S. Department of Homeland Security

https://www.cbc.ca/lite/story/9.7116182
1•colinprince•9m ago•0 comments

Indirect Prompt Injection in Web-Browsing Agents

https://www.promptfoo.dev/blog/indirect-prompt-injection-web-agents/
1•mooreds•9m ago•0 comments

Gitgo: A Go implementation of Git functions (2016)

https://github.com/ChimeraCoder/gitgo
1•todsacerdoti•9m ago•1 comments

NY bill to require devices to conduct commercially reasonable age assurance

https://github.com/flatpak/xdg-desktop-portal/pull/1922
2•nickslaughter02•11m ago•1 comments

How do I get startups to use my open-code project?

2•ErezShahaf•12m ago•0 comments

Ask HN: Resources to make devs more AI aware

1•myworkaccount2•13m ago•1 comments

My first post, I'll try not to muck it up :P

https://groups.google.com/g/retro-car-radio
1•prestodigital•14m ago•1 comments

Ben Affleck Founded a Filmmaker-Focused AI Tech Company. Netflix Just Bought It.

https://www.hollywoodreporter.com/business/digital/ben-affleck-ai-netflix-1236521806/
1•voxadam•14m ago•1 comments

The AI Benchmark Trap

https://ianreppel.org/the-ai-benchmark-trap/
1•speckx•15m ago•0 comments

Amazon checkout is not working

https://downdetector.ca/status/amazon/
4•kaypee901•15m ago•1 comments

Sacred Values of Future AIs

https://www.lesswrong.com/posts/sjeqDKhDHgu3sxrSq/sacred-values-of-future-ais
1•gmays•17m ago•0 comments

The Future of Healthcare Will Be Built on Enhanced Data

1•BE-Healthmetryx•18m ago•0 comments

Lock.pub – AI helped me turn a 3-year-old side project into a real product

https://lock.pub/
1•astraljoker•19m ago•1 comments

Am I Being Pwned? See what your Chrome extensions are exfiltrating

https://amibeingpwned.com/demo/hackernews
2•acorn221•19m ago•0 comments

Show HN: Argmin AI, system level LLM cost optimization for agents and RAG

https://argminai.com
2•konyrevdmitriy•21m ago•0 comments

Show HN: Mumpix – persistent memory for AI agents (works in browser and Node)

https://www.mumpixdb.com/
2•carreraellla•21m ago•1 comments

AI helped me try a new workout app

https://matanabudy.com/ai-helped-me-try-a-new-workout-app/
2•matanabudy•22m ago•0 comments

Amazon Books Are Down

https://www.amazon.com/dp/B0DZ666HCL
4•ada1981•22m ago•3 comments

From Logistic Regression to AI

https://www.johndcook.com/blog/2026/03/04/from-logistic-regression-to-ai/
3•gmays•23m ago•0 comments

Show HN: Reconlify – local-first reconciliation CLI for CSV/TSV and text logs

https://github.com/testuteab/reconlify-cli
2•testuteab•24m ago•0 comments

I've never parented a 6-year-old. But I've dealt with macOS system updates

https://ihnatko.com/ive-never-had-the-experience-of-parenting-a-6-year-old-child-but-ive-dealt-wi...
5•latexr•25m ago•1 comments

Show HN: Sigil – source code security analysis for MCP servers (open source)

https://github.com/sigildev/sigil
2•sigildev•26m ago•0 comments

Show HN: FreshLimePay – Generate PayPal and Stripe checkout buttons

https://www.freshlimepay.com/
2•powerwild•27m ago•0 comments