Show HN: RAG Firewall – retrieval-time guardrails for LangChain/LlamaIndex

https://github.com/taladari/rag-firewall

1•talbuilds•5mo ago

RAG pipelines are great, but they can still retrieve "toxic" chunks: – prompt injection attempts – leaked API keys/secrets – stale or conflicting content – unapproved external URLs

We built an open-source "retrieval firewall" that scans chunks before they reach the LLM: – denies injection & secrets – flags/reranks PII, encoded blobs, untrusted URLs – audit log (JSONL) of all decisions – drop-in wrappers for LangChain and LlamaIndex retrievers

Install: pip install rag-firewall Repo: https://github.com/taladari/rag-firewall

Curious if others here handle retrieval-time risks, or just ingest/output filtering. Would love feedback and red-team payloads.

Comments

talbuilds•5mo ago

A couple of extra notes I didn’t fit in the main post:

– The firewall runs entirely client-side, so no data ever leaves your environment.

– It focuses on *retrieval-time* risks, not output moderation — so the LLM never sees poisoned chunks in the first place.

– Policies are YAML: you can choose to deny, allow, or just re-rank risky docs (based on recency, provenance, relevance).

– Overhead is low: scanners are regex/heuristic, so for ~5–20 chunks it adds only a few ms.

I’d love feedback on two things in particular:

1. Do you think retrieval-time filtering belongs in the pipeline, or should it all be done at ingest/output?

2. If you’ve got prompt injection payloads or edge cases you use to test your own RAG stacks, I’d love to try them against this.

Thanks for taking a look — always happy to hear critique, especially from folks running LangChain/LlamaIndex in production.

talbuilds•5mo ago

300+ installs in 24h, RAG Firewall now with GraphRAG support.

danshalev7•5mo ago

Very interesting

Interview with 'Just use a VPS' bro (OpenClaw version) [video]

EchoJEPA: Latent Predictive Foundation Model for Echocardiography

Disablling Go Telemetry

Effective Nihilism

The UK government didn't want you to see this report on ecosystem collapse

No 10 blocks report on impact of rainforest collapse on food prices

Seedance 2.0 Is Coming

Show HN: Fitspire – a simple 5-minute workout app for busy people (iOS)

Dexterous robotic hands: 2009 – 2014 – 2025

Interop 2025: A Year of Convergence

JobArena – Human Intuition vs. Artificial Intelligence

Concept Artists Say Generative AI References Only Make Their Jobs Harder

Show HN: PaySentry – Open-source control plane for AI agent payments

Show HN: Moli P2P – An ephemeral, serverless image gallery (Rust and WebRTC)

The Crumbling Workflow Moat: Aggregation Theory's Final Chapter

Pax Historia – User and AI powered gaming platform

Show HN: I built a RAG engine to search Singaporean laws

Scams, Fraud, and Fake Apps: How to Protect Your Money in a Mobile-First Economy

Porting Doom to My WebAssembly VM

Cognitive Style and Visual Attention in Multimodal Museum Exhibitions

Full-Blown Cross-Assembler in a Bash Script

Logic Puzzles: Why the Liar Is the Helpful One

Optical Combs Help Radio Telescopes Work Together

Show HN: Myanon – fast, deterministic MySQL dump anonymizer

The Tao of Programming

Forcing Rust: How Big Tech Lobbied the Government into a Language Mandate

PanelBench: We evaluated Cursor's Visual Editor on 89 test cases. 43 fail

Can You Draw Every Flag in PowerPoint? (Part 2) [video]

Show HN: MCP-baepsae – MCP server for iOS Simulator automation

Make Trust Irrelevant: A Gamer's Take on Agentic AI Safety