frontpage.

Hi HN,

I'm building Nexus Gateway, an AI gateway that helps developers reduce LLM API costs.

Problem: Many applications send repeated or semantically similar prompts to LLMs, which leads to unnecessary API calls and higher costs.

Solution: Nexus Gateway uses semantic caching to detect similar prompts and serve cached responses instead of calling the LLM again.

Features: • Semantic caching to reduce repeated API calls • Multi-model support (OpenAI, Gemini, Llama, Anthropic) • BYOK support • PII protection and sovereign AI layer (in progress)

Goal: Reduce LLM costs by 40–70% while improving latency.

I’d really appreciate feedback from the community.

Website: https://www.nexus-gateway.org

Show HN: Nostr DM bot – control OpenCode/Cursor via DMs, pay with cashu tokens

Robots Won't Fold Your Laundry: The Biomllion-Dollar Humanoid Roboics Gap

Show HN: Anki(-Ish) for Music Theory

Show HN: Conserved amino acid contacts across 70k protein structures

Epic and Google have signed a special deal for a new class of 'metaverse' apps

China ramps up 'high stakes' tech race with US as economic imbalances deepen

Surprising Gender Biases in GPT

Kristi Noem Out at U.S. Department of Homeland Security

Indirect Prompt Injection in Web-Browsing Agents

Gitgo: A Go implementation of Git functions (2016)

NY bill to require devices to conduct commercially reasonable age assurance

How do I get startups to use my open-code project?

Ask HN: Resources to make devs more AI aware

My first post, I'll try not to muck it up :P

Ben Affleck Founded a Filmmaker-Focused AI Tech Company. Netflix Just Bought It.

The AI Benchmark Trap

Amazon checkout is not working

Sacred Values of Future AIs

The Future of Healthcare Will Be Built on Enhanced Data

Lock.pub – AI helped me turn a 3-year-old side project into a real product

Am I Being Pwned? See what your Chrome extensions are exfiltrating

Show HN: Argmin AI, system level LLM cost optimization for agents and RAG

Show HN: Mumpix – persistent memory for AI agents (works in browser and Node)

AI helped me try a new workout app

Amazon Books Are Down

From Logistic Regression to AI

Show HN: Reconlify – local-first reconciliation CLI for CSV/TSV and text logs

I've never parented a 6-year-old. But I've dealt with macOS system updates

Show HN: Sigil – source code security analysis for MCP servers (open source)

Show HN: FreshLimePay – Generate PayPal and Stripe checkout buttons

Show HN: Nexus Gateway – Reduce LLM API Costs Using Semantic Caching

Comments