frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: ClawGuard – Detect 42 prompt injection patterns in <10ms

https://github.com/joergmichno/clawguard
2•joergmichno•1d ago

Comments

joergmichno•1d ago
A bit more context on why pattern matching instead of ML:

1. Speed: <10ms vs 200-500ms for LLM-based checks means you can scan every user message without adding latency.

2. Cost: No API calls to OpenAI/Anthropic for detection = predictable costs at scale.

3. Transparency: When a pattern matches, you know exactly which of the 42 patterns triggered and why. No "the model thinks this looks suspicious."

The tradeoff is obvious — patterns can't catch truly novel attacks. But neither can LLMs reliably (they get tricked by the same prompt injections they're supposed to detect).

My goal: catch the 80% of attacks that are copy-pasted from public prompt injection databases, so you can focus your resources on the remaining 20%.

For CI/CD users: the GitHub Action runs ClawGuard on every PR, so you catch injections before they reach production. The Python SDK lets you integrate scanning into your agent pipeline with two lines of code.

Would love to hear from folks running AI agents in production — what's your current detection strategy?

Someone•1d ago
I think this is an example where obscurity is required to get (some) security. Making this and its test cases public makes training a model to circumvent it too easy.
joergmichno•1d ago
Fair point — and one I thought about carefully before open-sourcing.

A few reasons why I think open patterns are actually the right call:

1. The patterns are already public. Most prompt injection techniques are documented on GitHub, in research papers, and on sites like jailbreakchat. Attackers don't need my regex list — they already have the playbook.

2. Security through obscurity doesn't work for defense. History (from antivirus to WAFs to OWASP) shows that open detection rules get more eyes, more contributions, and faster updates than closed ones. Snort, ModSecurity, YARA — all open, all industry standard.

3. The real threat isn't regex bypass. If an attacker is sophisticated enough to craft novel prompts that evade pattern matching, they'll also evade most LLM-based detectors. The answer for that 20% is layered defense (output filtering, sandboxing, least-privilege), not secret patterns.

4. Open source = trust. Enterprise customers want to audit what's running in their pipeline. "Trust us, it's secret" is a harder sell than "here are the exact 42 patterns, verify them yourself."

That said — the paid Shield API does include additional detection layers beyond the open-source patterns, specifically for this reason.

IronDiff – Network Config Backup and Analysis

1•MattRos•52s ago•0 comments

Ruby Users Forum February–March Update

https://www.rubyforum.org/t/monthly-update-february-wrap-up-march-preview/177
1•todsacerdoti•1m ago•0 comments

Amazon Wins Court Order Blocking Perplexity AI Shopping Bots

https://www.bloomberg.com/news/articles/2026-03-10/amazon-wins-court-order-blocking-perplexity-s-...
1•juokaz•3m ago•0 comments

Fooling AI Agents: Web-Based Indirect Prompt Injection Observed in the Wild

https://unit42.paloaltonetworks.com/ai-agent-prompt-injection/
2•tcbrah•3m ago•0 comments

Precision Learning Has the Potential to Do What Personalized Learning Could Not

https://www.the74million.org/article/precision-learning-has-the-potential-to-do-what-personalized...
1•jyunwai•3m ago•0 comments

Towards "Let's Encrypt" for Document Signing

https://documenso.com/blog/building-documenso-part-3---aes-qes-qtsp
1•ElTimuro•4m ago•1 comments

Amazon asks senior engineers to address issues created by 'AI assisted changes'

https://www.tomshardware.com/tech-industry/artificial-intelligence/amazon-calls-engineers-to-addr...
2•cdrnsf•4m ago•1 comments

Tesla FSD drives through railroad crossing barriers in viral video

https://electrek.co/2026/03/09/tesla-fsd-drives-through-railroad-crossing-barriers-viral-video/
1•nixass•4m ago•0 comments

Why on-device agentic AI can't keep up

https://martinalderson.com/posts/why-on-device-agentic-ai-cant-keep-up/
1•devy•4m ago•0 comments

Rust Coreutils 0.7 Released with Many Performance Optimizations

https://www.phoronix.com/news/Rust-Coreutils-uutils-0.7
1•Bender•4m ago•0 comments

$3 ChromeOS Flex stick will revive old and outdated computers

https://9to5google.com/2026/03/10/this-3-chromeos-stick-will-revive-old-and-outdated-computers/
2•pentagrama•6m ago•0 comments

Ultra-compact photonic AI chip operates at the speed of light

https://techxplore.com/news/2026-03-ultra-compact-photonic-ai-chip.html
1•Brajeshwar•6m ago•0 comments

Tiny transmitter could help scientists understand surprisingly social wasps

https://spectrum.ieee.org/rf-tags
1•Brajeshwar•6m ago•0 comments

Hiroo Onoda: The Japanese Soldier Who Continued Fighting World War II Until 1974

https://www.utterlyinteresting.com/post/hiroo-onoda-japanese-soldier-who-fought-ww2-until-1974
1•freediver•7m ago•0 comments

Experiments.md to stay sane down the rabbit hole

https://www.execfoo.de/blog/experiments.html
1•softwarehippie•7m ago•0 comments

Trump Admin Cyber Strategy Centers Private Sector in Offensive Cyber Operations

https://www.lawfaremedia.org/article/trump-admin-cyber-strategy-centers-private-sector-in-offensi...
1•hn_acker•7m ago•0 comments

How did the Apollo flight computers get men to the moon and back? (2018) [video]

https://www.youtube.com/watch?v=ULGi3UkgW30
2•spking•10m ago•0 comments

YouTube ads are about to get even longer and they'll be unskippable

https://www.dexerto.com/youtube/youtube-ads-are-about-to-get-even-longer-and-theyll-be-unskippabl...
3•robtherobber•10m ago•0 comments

Gemini Embedding 2: Our first natively multimodal embedding model

https://blog.google/innovation-and-ai/technology/developers-tools/gemini-embedding-2/
2•meetpateltech•11m ago•0 comments

Ask HN: What will be the future of RPE in IT services

1•sriramgonella•11m ago•0 comments

Show HN: What was the world listening to? Music charts, 20 countries (1940–2025)

https://88mph.fm/
1•matteocantiello•11m ago•0 comments

An Update on SVG in GTK

https://blogs.gnome.org/gtk/2026/02/25/an-update-on-svg-in-gtk/
1•PaulHoule•12m ago•0 comments

Ad-tech is fascist tech

https://pluralistic.net/2026/03/10/ice-tech/#foreseeable-outcomes
8•only_in_america•12m ago•0 comments

I built and used this boilerplate to generate $2.5M in revenue over 5 years

https://www.indiehackers.com/post/i-built-and-used-this-boilerplate-to-generate-2-5m-in-revenue-o...
2•iwillinc•12m ago•0 comments

Show HN: React Tourlight

https://github.com/btahir/react-tourlight
1•bilater•13m ago•0 comments

Cold Outreach

https://blog.fortrabbit.com/cold-outreach/
1•esher•13m ago•0 comments

Sweat of Tourists Has Covered Michelangelo's Sistine Chapel Fresco in White Film

https://www.smithsonianmag.com/smart-news/the-sweat-of-tourists-has-covered-michelangelos-sistine...
1•bookofjoe•13m ago•0 comments

Live Nation and US Justice Department Edge Towards Settling Antitrust Lawsuit

https://thequietus.com/news/live-nation-and-us-justice-department-edge-towards-settling-antitrust...
1•cdrnsf•14m ago•0 comments

Ask HN: What are you using OpenClaw for?

1•arbuge•15m ago•0 comments

How Well Does Agent Development Reflect Real-World Work?

https://arxiv.org/abs/2603.01203
1•fauigerzigerk•15m ago•0 comments