frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: A 3-line wrapper that enforces deterministic security for AI agents

1•tonyww•1h ago
If you are building AI agents with frameworks like browser-use, LangChain, or OpenClaw, you've likely hit the "blast radius" problem.

A misconfigured prompt or hallucination can cause an agent to navigate to a phishing domain, expose an API key, or confidently claim a task succeeded when it actually clicked a disabled button.

The standard fix right now is "LLM-as-a-judge"—taking a screenshot after the fact and asking GPT-4, "Did this work and is it safe?" That introduces massive latency, burns tokens, and is fundamentally probabilistic.

We built predicate-secure to fix this.

It’s a drop-in Python wrapper that adds a deterministic physics engine to your agent's execution loop.

In 3 to 5 lines of code, without rewriting your agent, it enforces a complete three-phase loop:

Pre-execution authorization:

Before the agent's action hits the OS or browser, it is intercepted and evaluated against a local, fail-closed YAML policy. (e.g., Allow browser.click on button#checkout, Deny fs.read on ~/.ssh/*).

Action execution:

The agent executes the raw Playwright/framework action.

Post-execution verification:

It mathematically diffs the "Before" and "After" states (DOM or system) to prove the action succeeded.

To avoid the "LLM-as-a-judge" trap, the execution of the verification is purely mathematical. We use a local, offline LLM (Qwen 2.5 7B Instruct) strictly to generate the verification predicates based on the state changes (e.g., asserting url_contains('example.com') or element_exists('#success')), and then the runtime evaluates those predicates deterministically in milliseconds.

The DX looks like this:

from predicate_secure import SecureAgent from browser_use import Agent

1. Your existing unverified agent

agent = Agent(task="Buy headphones on Amazon", llm=my_model)

2. Drop-in the Predicate wrapper

secure_agent = SecureAgent( agent=agent, policy="policies/shopping.yaml", mode="strict" )

3. Runs with full Pre- & Post-Execution Verification

secure_agent.run()

We have out-of-the-box adapters for browser-use, LangChain, PydanticAI, OpenClaw, and raw Playwright.

Because we know developers hate giving external SaaS tools access to their agent's context, the entire demo and verification loop runs 100% offline on your local machine (tested on Apple Silicon MPS and CUDA).

For enterprise/production fleets, the pre-execution gate can optionally be offloaded to our open-source Rust sidecar (predicate-authorityd) for <1ms policy evaluations.

The repo is open-source (MIT/Apache 2.0). We put together a complete, offline demo showing the wrapper blocking unauthorized navigation and verifying clicks locally using the Qwen 7B model.

Repo and Demo: https://github.com/PredicateSystems/predicate-secure

Another demo for securing your OpenClaw:

https://github.com/PredicateSystems/predicate-claw

Demo (GIF):

https://github.com/PredicateSystems/predicate-claw/blob/main...

I'd love to hear what the community thinks about deterministic verification vs. probabilistic LLM judges, or answer any questions about the architecture!

Comments

selfradiance•1h ago
Interesting approach. The deterministic verification vs. LLM-as-judge choice is the right call — probabilistic safety checks on safety-critical actions is a category error. One thing I've been thinking about: policy-based pre-execution authorization handles the prevention side well, but there's a complementary problem — what happens when an agent operates across trust boundaries where you can't predefine every allowed action? I've been exploring an economic accountability model (bond-and-slash, inspired by crypto staking) where agents post collateral that gets slashed on verified misbehavior. Prevention + accountability as two layers rather than one. Repo if anyone's curious about the other side of this: https://github.com/selfradiance/agentgate

How Predictable Are the Oscars?

https://futuresearch.ai/oscars/
2•nbosse•2m ago•1 comments

Revealed: Face of 75,000-year-old female Neanderthal from cave

https://www.cam.ac.uk/stories/shanidar-z-face-revealed
2•thunderbong•3m ago•0 comments

AI agent 'lobster fever' grips China despite risks

https://techxplore.com/news/2026-03-ai-agent-lobster-fever-china.html
1•Brajeshwar•3m ago•0 comments

LDP: Identity-Aware Routing for Multi-Agent LLMs – 37% Less Tokens

https://arxiv.org/abs/2603.08852
1•prakashsunil•4m ago•0 comments

When code is free, research is all that matters

https://twitter.com/amytam01/status/2031072399731675269
1•gmays•4m ago•0 comments

Lessons from scaling ClickHouse to petabytes of AI observability data

https://langfuse.com/blog/2026-03-10-simplify-langfuse-for-scale
2•marcklingen•6m ago•0 comments

Self-Driving Corporations (2020)

https://interconnected.org/home/2020/11/17/self_driving_corporations
1•alcazar•6m ago•0 comments

The Colorado River Does Not Reach 2030

https://drlennecefer.substack.com/p/the-colorado-river-does-not-reach
1•ThemalSpan•6m ago•0 comments

I built a GDPR analytics alternative to Google Analytics

https://eurometrics.eu
1•snesmachny•7m ago•0 comments

Lost in Backpropagation: The LM Head Is a Gradient Bottleneck

https://arxiv.org/abs/2603.10145
1•famouswaffles•7m ago•0 comments

The web in 1000 lines of C

https://maurycyz.com/projects/tinyweb/
1•maurycyz•8m ago•0 comments

Treat Interfaces as Organizational Treaties

1•davidvartanian•9m ago•0 comments

Open source UnigetUI bought by Devolutions Inc

https://github.com/Devolutions/UniGetUI/discussions/4444
1•erremerre•9m ago•0 comments

Ask HN: Best Practices for Agent Airgapping?

1•bmau5•12m ago•0 comments

Ask HN: App for clean movie/TV shorts?

2•dev181•12m ago•0 comments

How an Electrician from Kentucky Built an AI Startup with Claude

https://twitter.com/vivilinsv/status/2031826324667711774
1•pretext•12m ago•0 comments

Who's a Better Writer: A.I. Or Humans? Take Our Quiz

https://www.nytimes.com/interactive/2026/03/09/business/ai-writing-quiz.html
1•A_Duck•14m ago•1 comments

Show HN: MaximusLLM, Breaking transformer's O(N^2) and O(V) scaling bottlenecks

https://github.com/yousef-rafat/MaximusLLM/tree/main
1•yousef_g•15m ago•0 comments

Show HN: We built a billion row spreadsheet

https://rowzero.com
1•breckognize•15m ago•0 comments

Indiehacking: Lessons from 9K USD in Facebook Ad Spend [video]

https://www.youtube.com/watch?v=xoe_l67ZReU
2•mesmertech•16m ago•1 comments

Show HN: Thermal Receipt Printers – Markdown and Web UI

https://github.com/sadreck/ThermalMarky
1•howlett•16m ago•0 comments

Harness Engineering for Coding Agents

https://www.humanlayer.dev/blog/skill-issue-harness-engineering-for-coding-agents
2•0xblacklight•17m ago•0 comments

Amazon AI Outage Financial Times Correction

https://www.aboutamazon.com/news/company-news/amazon-outage-ai-financial-times-correction
2•super_linear•17m ago•0 comments

Bike and walking trails lose millions under Trump

https://rhodeislandcurrent.com/2026/03/13/repub/bike-and-walking-trails-lose-hundreds-of-millions...
3•chmaynard•19m ago•0 comments

Xnxx

1•zhhshs•20m ago•0 comments

Show HN: Re-imagine photo albums with NanoBanana

https://github.com/hbmartin/imagemine
1•hmartin•20m ago•0 comments

Darn Grid Shader (Yet). For as long as I've been writing

https://bgolus.medium.com/the-best-darn-grid-shader-yet-727f9278b9d8
1•signa11•21m ago•0 comments

Designing AI agents to resist prompt injection

https://openai.com/index/designing-agents-to-resist-prompt-injection/
2•gmays•21m ago•0 comments

How to Shut Down Meta

https://blog.zgp.org/how-to-shut-down-meta/
3•speckx•21m ago•0 comments

Mozilla Data Collective

https://datacollective.mozillafoundation.org/datasets
1•abetusk•23m ago•0 comments