news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Sentinel – Go LLM Proxy with 13ms Semantic Cache and PII Scrubbing

https://sentinelgateway.ai

1•ChipShotz•1h ago

Comments

ChipShotz•1h ago

Hey HN,

I built this because I kept running into two big headaches while scaling AI apps. First was the anxiety of accidentally leaking customer PII to OpenAI. Second was paying providers over and over for the exact same semantic queries.

I looked at existing routers, but I wanted something optimized for speed and strict privacy. So I built Sentinel Gateway using Go and Redis.

How it works:

Real-Time Trace Observability: Every request is logged in a central dashboard. You can see exactly what users are asking, which provider handled it, and confirm that PII was scrubbed in real-time. It’s a black box recorder for your LLM implementation.

Network-Level PII Guard: Instead of writing regex scrubbers in every app, you route through the gateway. It redacts SSNs, credit cards, and emails before the payload ever touches a public API.

Semantic Caching: A Redis layer checks for semantic similarity. If a user asks the same question, it serves the response in ~13ms and bypasses LLM costs entirely.

Multi-Provider Billing: It tracks token throughput and raw costs for OpenAI, Anthropic, Gemini, and Groq in one place.

I just finished a clean wipe of the database logs to get ready for launch. There is a free Hobby tier live now where you can test the PII redaction and watch the 13ms cache hits in real-time.

I am a solo dev and would love your honest feedback. What obvious edge cases am I missing in the caching logic?

Pike – Solving the "should we stop here or gamble on the next exit" problem

https://tomjohnell.com/pike-solving-the-should-we-stop-here-or-gamble-on-the-next-exit-problem/

1•tjohnell•2m ago•1 comments

Gemini 3.1 Flash-Lite

https://twitter.com/GoogleDeepMind/status/2028872381477929185

1•pat2man•3m ago•0 comments

Altman admits OpenAI can't control Pentagon's use of AI

https://www.theguardian.com/technology/2026/mar/04/sam-altman-openai-pentagon

2•albumen•4m ago•0 comments

European pensions are a $30T missed opportunity

https://www.economist.com/finance-and-economics/2026/03/04/european-pensions-are-a-30trn-missed-o...

1•vinni2•6m ago•0 comments

JSE: A Structural Expression Protocol for AI Agents

1•mars_liu•7m ago•0 comments

Unveiling the Weaponized Web Shell EncystPHP

https://www.fortinet.com/blog/threat-research/unveiling-the-weaponized-web-shell-encystphp

1•WeaklingOra•7m ago•0 comments

Extending single-minus amplitudes to gravitons

https://openai.com/index/extending-single-minus-amplitudes-to-gravitons/

2•telotortium•7m ago•0 comments

Show HN: Residuum | Agentic AI with continuous context

https://github.com/Grizzly-Endeavors/residuum

1•BearFlinn•8m ago•0 comments

Platform Designed for Motorists and Law Enforcement for Safety

https://www.traafik.com/

1•fcpguru•8m ago•0 comments

Rules for Pricing Client Engagements

https://b2bs.substack.com/p/op-note-3-rules-for-pricing-client

1•ohjeez•13m ago•0 comments

TakeoutReader – Turn your Google Takeout JSON into a readable report

1•martinZak•13m ago•1 comments

Show HN: One provider starts lying at request 50. The quorum catches it

https://github.com/sbw70/verification-constraints/blob/main/modules/integrated-constraint-demos/t...

1•sbw70•14m ago•0 comments

Roundup of Events for Bootstrappers in March 2026

https://bootstrappersbreakfast.com/2026/02/24/roundup-of-march-bootstrapper-events/

1•skmurphy•16m ago•1 comments

How Jeffrey Epstein Used Reid Hoffman to Court Silicon Valley's Elite

https://www.bloomberg.com/news/articles/2026-03-04/how-jeffrey-epstein-used-reid-hoffman-to-court...

5•petethomas•17m ago•1 comments

Stathat Is Shutting Down

3•jervant•19m ago•0 comments

Show HN: RustyRAG lowest-latency open-source RAG on GitHub

https://github.com/AlphaCorp-AI/RustyRAG

1•zer0tokens•20m ago•0 comments

Senate fails to block US involvement in Iran war

https://www.usatoday.com/story/news/politics/2026/03/04/iran-war-powers-resolution-senate-vote/88...

3•geox•21m ago•0 comments

OpenAI, Anthropic turn to consultants to fight over the enterprise market

https://www.businessinsider.com/openai-and-anthropic-using-consultants-to-fight-enterprise-battle...

1•wavelander•21m ago•0 comments

Wgsl-Rs

https://renderling.xyz/articles/introducing-wgsl-rs.html

2•efnx•22m ago•0 comments

Show HN: MoatRadar – AI investment research through Warren Buffett's principles

https://www.moatradar.com/?promo=HACKERNEWS

1•chodelka•23m ago•0 comments

Show HN: I built CLI for developer docs locally working with any Coding Agent

https://github.com/lifez/docsearch

2•lifez•23m ago•0 comments

Luckin Coffee Backer Centurium in Advanced Talks for Nestle's Blue Bottle

https://www.businesstimes.com.sg/companies-markets/consumer-healthcare/luckin-coffee-backer-centu...

2•doppp•25m ago•0 comments

The Training Data Paradox

https://www.ivanturkovic.com/2026/03/01/training-data-paradox-ai-replacing-engineers-who-trained-it/

1•fmkamchatka•25m ago•0 comments

The US is using repurposed Iranian drone technology to attack Iran

https://theconversation.com/the-us-is-using-repurposed-iranian-drone-technology-to-attack-iran-a-...

2•mosura•27m ago•0 comments

Show HN: Potatoverse, home for your vibecoded apps

https://github.com/blue-monads/potatoverse

5•born-jre•28m ago•1 comments

Determinate is the future of Nix today: Wasm, provenance, and flake schemas

https://determinate.systems/blog/determinate-nix-future/

2•biggestlou•28m ago•0 comments

Neither Android nor iOS: DIY Smartphone Runs on ESP32

https://hackaday.com/2026/03/04/neither-android-nor-ios-diy-smartphone-runs-on-esp32/

3•HardwareLust•30m ago•0 comments

I made a website where you can: Scan a bill. Split it fairly. No app needed

https://snapfair.pages.dev/

2•Herliken•31m ago•0 comments

Thoughts and Observationsp on the MacBook Neo

https://daringfireball.net/2026/03/599_not_a_piece_of_junk_macbook_neo

1•alwillis•32m ago•0 comments

Show HN: Kvlar – Open-source firewall for AI agent tool calls

https://github.com/kvlar-io/kvlar

1•kvlar•32m ago•0 comments