frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Prompt-injection firewall for OpenClaw agents

https://github.com/ContextFort-AI/clawdbot-runtime-controls
3•ashwinr2002•1h ago
People seem to be blindly hooking up their OpenClaw’s to their personal data. So, I built runtime controls to prevent at the least, very simple prompt injection attacks.

Once installed, it hooks to Node.js child_process module in the gateway process and listens to tool calls and their response streams. And a fetch hook to monitor user prompts (both could’ve been through fetch, happy to discuss why this whole layer couldn’t just be a proxy).

There are two layers of protection:

First: Whenever there is a read-only tool call whose response an attacker can modify, we extract that part of the json response and send it to a small haiku model to check if it has instruction asking the LLM to do something different

Second: For when the prompt injection detection fails, we maintain a list of function calls which can write to places that an external actor can access. We prompt the user for explicit permission to go forward through the UI.

I would love a discussion on how this second layer could be made better and less frequent by relying on some decision process. My current idea: Based on a collected set of “trusted” context (user prompts, responses from tool calls attackers cannot manipulate), can we detect if this tool call was necessary. There are scenarios where you’d need detection at the parameter-level.

Two notes:

1) This cannot just be a proxy because you need application level integration to have humans in the loop when needed and push UI controls.

2) How i improved accuracy of detecting prompt injection is by selecting only that content from the entire response json that can be manipulated by an external actor. This had to be done for each tool separately. The current implementation is for 2 skills I randomly chose (Notion & Github).

P.S.: I maintain one for claude code myself while working: https://github.com/ContextFort-AI/Runtime-Controls, I created this over the weekend OpenClaw

Show HN: Copost – A team LinkedIn tool inspired by 37signals and PostHog

https://copost.fr/
1•alexandrechs•41s ago•0 comments

The Government Published Nude Photos in the Epstein Files

https://www.nytimes.com/2026/02/01/us/nude-photos-epstein-files.html
1•doener•43s ago•0 comments

Docker AI agent sandboxes with HyperVisor isolation

https://www.docker.com/blog/docker-sandboxes-run-claude-code-and-other-coding-agents-unsupervised...
1•pploug•2m ago•1 comments

Show HN: Make AI motion videos with text

https://framecall.com/
1•mesmertech•2m ago•0 comments

Animated Knots

https://www.animatedknots.com/
1•ostacke•2m ago•0 comments

Show HN: DiscoC – A hobby compiler/linker for the SuperFX (SNES)

https://github.com/DiscoManOfficial/DiscoC
1•DiscoResearch•3m ago•0 comments

Show HN: Bullmq-dash – Terminal UI dashboard for BullMQ (zero setup)

https://www.npmjs.com/package/bullmq-dash
1•quanghuynt14•3m ago•0 comments

Reverse Engineering River Raid with Claude, Ghidra, and MCP

https://quesma.com/blog/ghidra-mcp-unlimited-lives/
2•stared•4m ago•0 comments

Looking back on 2025

https://kreya.app/blog/looking-back-on-2025/
1•ni507•5m ago•0 comments

Show HN: Oh-my-ag. Role-based agent orchestration for Antigravity

https://github.com/first-fluke/oh-my-ag
1•otti-sister•6m ago•0 comments

Oracle to Raise Up to $50B in 2026 for Cloud Buildup

https://finance.yahoo.com/news/oracle-raise-50-billion-2026-235033434.html
1•mooreds•7m ago•0 comments

Vinklu Turns Forgotten Plot in Bucharest into Tiny Coffee Shop

https://design-milk.com/vinklu-turns-forgotten-plot-in-bucharest-into-tiny-coffee-shop/
1•surprisetalk•8m ago•0 comments

Miniroll: A Blogroll Directory

https://www.miniroll.app/
1•surprisetalk•8m ago•0 comments

AdBoost: A Browser Extension That Adds Ads To Every Webpage

https://github.com/surprisetalk/AdBoost
1•surprisetalk•8m ago•0 comments

Generate Photorealistic Raytraced Images from Real-Time 3D Using AI

https://www.glb2png.com/blog/raytraced_renderings_using_ai
1•tehfonsi•9m ago•1 comments

Lying Has to Stop: Keeping AI Honest with OpenTelemetry [video]

https://www.youtube.com/watch?v=48_v7VNZCzk
1•mooreds•10m ago•0 comments

Infosec Registered Assessors Program (IRAP)

https://www.cyber.gov.au/business-government/protecting-devices-systems/assessment-evaluation-pro...
1•mooreds•10m ago•0 comments

Two CBP agents identified in Alex Pretti shooting

https://www.propublica.org/article/alex-pretti-shooting-cbp-agents-identified-jesus-ochoa-raymund...
2•heavyset_go•10m ago•1 comments

Why do RSS readers look like email clients?

https://manualdousuario.net/en/feed-readers-email-apps/
1•rpgbr•11m ago•0 comments

What Is an Incident, Anyway?

https://jensrantil.github.io/posts/what-is-an-incident/
1•JensRantil•12m ago•0 comments

Show HN: Agents should learn skills on demand. I built Skyll to make it real

https://www.skyll.app/
2•assafe•14m ago•0 comments

Three decades, three climates: the long-term reliability of photovoltaic modules

https://pubs.rsc.org/en/content/articlelanding/2025/el/d4el00040d
2•u1hcw9nx•15m ago•1 comments

What Killed Flash Player

https://medium.com/@aglaforge/what-really-killed-flash-player-a-six-year-campaign-of-deliberate-p...
2•lenulus•16m ago•0 comments

PaceCoach – Apple Watch app that taps your wrist when you're speaking too fast

1•olliverc•16m ago•0 comments

Jupyter Games on Notebook.link

https://notebook.link/@DerThorsten/jupyter-games-blogpost
4•SylvainCorlay•18m ago•1 comments

Show HN: Toktrack – Track your Claude Code token spending in under a second

https://github.com/mag123c/toktrack
2•mag123c•18m ago•2 comments

UK Government Launches Fuel Forecourt Price API

https://www.developer.fuel-finder.service.gov.uk/access-latest-fuelprices
2•Technolithic•18m ago•0 comments

Show HN: AI-Ready Enterprise Flutter Starter – Clean Architecture, DDD

https://github.com/deveminsahin/starter_app
1•deveminsahin•19m ago•0 comments

Building a Hybrid Esports Pick'em App with Astro and Firebase

https://lautarolobo.xyz/blog/fan-pickems/
1•lautarolobo•20m ago•0 comments

MaliciousCorgi: AI Extensions send your code to China

https://www.koi.ai/blog/maliciouscorgi-the-cute-looking-ai-extensions-leaking-code-from-1-5-milli...
4•tatersolid•20m ago•3 comments