frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: FireClaw – Open-source proxy defending AI agents from prompt injection

https://github.com/raiph-ai/fireclaw
4•raiph_ai•2h ago
Hey HN,

We built FireClaw because we kept watching AI agents get owned by prompt injection through web content. The agent fetches a page, the page says "ignore previous instructions," and suddenly your agent is leaking data or running commands it shouldn't.

The existing solutions detect injection after the fact. We wanted to prevent it.

FireClaw is a security proxy that sits between your AI agent and the web. Every fetch passes through a 4-stage pipeline:

1. DNS blocklist check (URLhaus, PhishTank, community feed) 2. Structural sanitization (strip hidden CSS, zero-width Unicode, encoding tricks) 3. Isolated LLM summarization (hardened sub-process with no tools or memory) 4. Output scanning with canary tokens (detect if content bypassed summarization)

The key insight: even if Stage 3's LLM gets injected, it has no tools, no memory, and no access to your data. It can only return text — which still gets scanned in Stage 4. The attacker hits a dead end.

Other design decisions: - No bypass mode. The pipeline is fixed. If your agent gets compromised, it can't disable FireClaw. - Community threat feed — instances anonymously share detection metadata (domain, severity, detection count) to build a shared blocklist. No page content is ever sent. - Runs on a Raspberry Pi as a physical appliance with an OLED display that shows real-time stats and lights up with animated flames when it catches a threat.

We searched the literature and open source extensively — no one else is doing proxy-based defense for agent prompt injection. Detection exists, sandboxing exists, but an inline proxy that sanitizes before content reaches the agent's context? We couldn't find it.

200+ detection patterns, JSONL audit logging, domain trust tiers, rate limiting, and cost controls. AGPLv3 licensed.

Website: https://fireclaw.app

Would love feedback from anyone working on AI agent security. What are we missing? What attack vectors should we add to the pattern database?

Comments

raiph_ai•2h ago
Creator here. Quick TL;DR and some context:

FireClaw = prompt injection firewall for AI agents. Proxy architecture, not just detection. 4-stage pipeline, no bypass mode, community threat feed.

The thing that surprised us most during research: nobody is doing this. There are great pattern detectors (Rebuff, LLM Guard, etc.) but they all work post-hoc — the content has already entered the agent's context by the time you detect injection. FireClaw intercepts it before that happens.

The Pi appliance was honestly just for fun at first, but it turns out having a physical box with a screen showing "3 threats blocked today" is surprisingly reassuring. The OLED does an animated fire claw when it catches something.

Happy to answer any questions about the architecture, the canary token system, or the threat feed privacy model.

ucsandman•1h ago
this is cool, definitely going to look into it and probably try to integrate it with my opensource project. prompt injection keeps me up at night thanks for putting in some work trying to solve it.
Terr_•2h ago
I'm reminded of all the man-hours spent building layers that prohibited someone's "about me" field from containing words like "update" or "delete" or "truncate".

Sure, technically it reduced the the odds of the system getting hacked, but it rankles against some engineering ideal of "not a proper fix." Yet it still happens, because a "proper fix" involves some change to the underlying layer (RDBMS or LLM).

nikolas_sapa•2h ago
nice concept. open claw is very valuable so this will help solve that. also checked your landing page and love the attacking raccoon. one thing I would change though is remove the emojis and add icons. but great work

Show HN: Antfly: Distributed, Multimodal Search and Memory and Graphs in Go

https://github.com/antflydb/antfly
50•kingcauchy•3h ago•22 comments

Show HN: March Madness Bracket Challenge for AI Agents Only

https://www.Bracketmadness.ai
48•bwade818•6h ago•20 comments

Show HN: Crust – A CLI framework for TypeScript and Bun

https://github.com/chenxin-yan/crust
27•jellyotsiro•14h ago•12 comments

Show HN: Horizon – GPU-accelerated infinite-canvas terminal in Rust

https://github.com/peters/horizon
3•petersunde•56m ago•0 comments

Show HN: Flowershow Publish Markdown in seconds. Hosted, free, zero config

https://flowershow.app/
5•rufuspollock•1h ago•0 comments

Show HN: Mech keyboard sounds driven by a hidden accelerometer in MacBooks

https://www.haptyk.com/
3•olvvier•2h ago•1 comments

Show HN: Oxyde – Pydantic-native async ORM with a Rust core

https://github.com/mr-fatalyst/oxyde
148•mr_Fatalyst•4d ago•73 comments

Show HN: Claude Code skills that build complete Godot games

https://github.com/htdt/godogen
279•htdt•1d ago•183 comments

Show HN: FireClaw – Open-source proxy defending AI agents from prompt injection

https://github.com/raiph-ai/fireclaw
4•raiph_ai•2h ago•4 comments

Show HN: Updated version of my interactive Middle-Earth map

https://github.com/Jean-Tinland/middle-earth/
2•jetin•2h ago•0 comments

Show HN: Thermal Receipt Printers – Markdown and Web UI

https://github.com/sadreck/ThermalMarky
109•howlett•4d ago•42 comments

Show HN: F0lkl0r3.dev – a searchable, interlinked map of computing history

https://f0lkl0r3.dev
2•dynamicwebpaige•3h ago•0 comments

Show HN: Unsloth Studio - Local Fine-tuning, Chat UI

https://github.com/unslothai/unsloth
6•danielhanchen•3h ago•0 comments

Show HN: Droeftoeter, a Terminal Coding Toy

https://github.com/whtspc/droeftoeter
30•whtspc64•4d ago•6 comments

Show HN: Zeroboot – sub-millisecond VM sandboxes using CoW memory forking

https://github.com/adammiribyan/zeroboot
5•adammiribyan•5h ago•2 comments

Show HN: M68k assembly emulator that runs in the browser

https://github.com/gianlucarea/m68k-interpreter
10•aldino97•9h ago•1 comments

Show HN: Drakkar.one – Google Maps embed replacement, no API keys, GDPR-ready

https://drakkar.one/
7•d0min0•6h ago•1 comments

Show HN: I built a React SDK to control apps with voice, gaze and gestures

https://www.youtube.com/watch?v=J7GLCerVHi4
2•andreabergonzi•2h ago•3 comments

Show HN: Hecate – Call an AI from Signal

https://github.com/rhodey/hecate
24•rhodey•1d ago•3 comments

Show HN: Hackerbrief – Top posts on Hacker News summarized daily

https://hackerbrief.vercel.app/
73•p0u4a•1d ago•46 comments

Show HN: Basalt – IDE-like documentation for infrastructure and API

https://basalt-docs.com
4•temakonkin•8h ago•0 comments

Show HN: Signet – Autonomous wildfire tracking from satellite and weather data

https://signet.watch
122•mapldx•2d ago•31 comments

Show HN: GDSL – 800 line kernel: Lisp subset in 500, C subset in 1300

https://firthemouse.github.io/
89•FirTheMouse•2d ago•20 comments

Show HN: What if your synthesizer was powered by APL (or a dumb K clone)?

https://octetta.github.io/k-synth/
92•octetta•2d ago•32 comments

Show HN: Sprinklz.io – An RSS reader with powerful algorithmic controls

https://sprinklz.io
14•sammy0910•1d ago•3 comments

Show HN: Android Native Reverse Tools

https://neocanable.github.io/2026/01/17/rosemary-development-status.html
2•neocanable•12h ago•1 comments

Show HN: GitAgent – An open standard that turns any Git repo into an AI agent

https://www.gitagent.sh/
146•sivasurend•3d ago•38 comments

Show HN: Ichinichi – One note per day, E2E encrypted, local-first

133•katspaugh•3d ago•59 comments

Show HN: Han – A Korean programming language written in Rust

https://github.com/xodn348/han
207•xodn348•2d ago•117 comments

Show HN: Goal.md, a goal-specification file for autonomous coding agents

https://github.com/jmilinovich/goal-md
29•jmilinovich•2d ago•8 comments