frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Open-source white-box agentic red teamer for AI agents

https://github.com/sundi133/wb-red-team
1•ashish-a•1h ago
Hi HN, Votal AI has built an OSS white-box agentic red teamer for pressure testing AI agents. Most AI red teaming tools treat your agent as a black box. They throw generic prompt injections at an endpoint and see what sticks. The problem is that agentic AI systems aren't just LLMs responding to prompts. They have tools (read_file, send_email, query_db), roles, multi-step decision chains, and the ability to take real actions. A black box approach misses the attack surface that actually matters.

This framework takes a white-box approach: you feed it your agent's architecture, its tool definitions, and its role configuration. It then generates thousands of multi-turn attack sequences that are specific to what your agent can actually do. In our benchmarks, white-box attacks found 5x more vulnerabilities than black-box approaches.

Some of the threat categories it covers that we think are under explored: chained data exfiltration, where a single prompt chains read_file into send_email and your data is gone before any alert fires. Cascading hallucination attacks that gradually corrupt agent reasoning across a conversation. Rogue agent behavior where agents get manipulated into taking actions outside their scope (unauthorized Slack messages, GitHub commits, webhook triggers). Indirect prompt injection via retrieved documents, emails, or web content that hijack your agent mid-task. Multi-agent privilege escalation where a compromised sub-agent poisons context flowing to an orchestrator. Out-of-band exfiltration through DNS lookups, HTTP callbacks, or steganographic patterns that bypass DLP entirely.

None of these show up in a CVE scanner. The biggest vulnerability in an agentic system isn't a code bug; it's what a rogue user or rogue agent can convince your AI to do.

Stack: TypeScript, MIT license. Here's a longer write up: https://votal.ai/white-box-red-teaming-for-agentic-ai-an-ope...

Would love feedback on the attack catalog structure, the white-box approach vs. black-box tradeoffs, and any threat categories we're missing. PRs and issues welcome. Thank you.

Huckle: Detect operational problems 30–90 days before they appear in metrics

https://github.com/The-Resonance-Institute/huckle-public
1•cherndon222•2m ago•1 comments

Show HN: A minimalist dungeon-crawler card game built with Deno

https://scoundrel.ever-forward.deno.net/play
1•davrodpin•3m ago•0 comments

Missiles a Month vs. 7 Interceptors – Why Centcom Shifted to Factories

https://brief.gizmet.dev/signal-100-missiles-a-month-vs-7-interceptors-why-centcom-shifted-t/
1•GIZINT•3m ago•1 comments

Toaster Settings: AI Agents and Classical French Cooking Techniques [video]

https://www.youtube.com/watch?v=S_Iqnt_Cf98
1•aarmenante•5m ago•0 comments

The Sky Tonight

https://theskylive.com/guide
1•susam•6m ago•0 comments

Padel Chess – tactical simulator for padel

https://www.padelchess.me/
1•AlexGerasim•7m ago•0 comments

How OpenClaw's Memory System Works

https://www.db0.ai/blog/how-openclaw-memory-works
1•shenli3514•7m ago•0 comments

Show HN: Build a knowledge graph from unstructured text in Python

https://github.com/arun1729/text-to-kg
1•am3141•8m ago•0 comments

I built a free site that can tell you if your hardware can run a model

https://llmscout.fit/#/
1•dinosoupy•8m ago•1 comments

PgBeam, a globally distributed PostgreSQL proxy

https://pgbeam.com/blog/why-i-built-pgbeam
1•PaulHoule•9m ago•0 comments

Words on Words on Words

https://lemoncosmos.com/blog/posts/2026/03/words/
1•midzer•9m ago•0 comments

Syntaqlite: High-fidelity devtools that SQLite deserves

https://lalitm.com/post/syntaqlite/
1•lalitmaganti•9m ago•0 comments

Show HN: Flotilla – An orchestrator for persistent agent fleets on Apple Silicon

https://github.com/UrsushoribilisMusic/agentic-fleet-hub
1•ursushoribilis•10m ago•1 comments

Show HN: I can no longer afford the silicon. Here is my autonomous HPC agent

https://github.com/KilianDiama/AutonomousRDAgent
1•diamajax•10m ago•0 comments

When Science Goes Agentic

https://cacm.acm.org/blogcacm/when-science-goes-agentic/
1•tchalla•12m ago•0 comments

Java 26 is here, and with it a solid foundation for the future

https://hanno.codes/2026/03/17/java-26-is-here/
3•mfiguiere•12m ago•0 comments

The Los Angeles Aqueduct Is Wild

https://practical.engineering/blog/2026/3/17/the-los-angeles-aqueduct-is-wild
2•michaefe•12m ago•0 comments

Consent.txt – compile one AI policy into robots.txt, AIPREF, and headers

https://github.com/GGeronik/consent-txt
1•geronik•15m ago•2 comments

Women are being abandoned by their partners on hiking trails

https://www.theguardian.com/lifeandstyle/ng-interactive/2026/mar/17/alpine-divorce-abandoned-hiki...
3•asib•16m ago•0 comments

Show HN: Chrome extension that hijacks any site's own API to modify it

https://github.com/hvardhan878/quark-browser-agent
1•hvardhan878•18m ago•0 comments

Reducing quarantine delay 83% using Genetic Algorithms for playbook optimization

https://www.securesql.info/2025/04/06/playbook-management/
1•projectnexus•18m ago•1 comments

Node.js blocks PR from dev because he used Claude Code to create it

https://github.com/nodejs/node/pull/61478
3•gregdoesit•18m ago•0 comments

Python 3.15's JIT is now back on track

https://fidget-spinner.github.io/posts/jit-on-track.html
2•guidoiaquinti•19m ago•0 comments

Remote Control for Agents

https://www.restate.dev/blog/a-remote-control-for-your-agents
2•gk1•19m ago•0 comments

Danger Coffee: Mold-Free Remineralized Coffee Replaces What Regular Coffee Takes

https://dangercoffee.com/
1•amyjo•19m ago•1 comments

Building a dry-run mode for the OpenTelemetry collector

https://ubuntu.com/blog/building-a-dry-run-mode-for-the-opentelemetry-collector
1•simskij•20m ago•0 comments

LotusNotes

https://computer.rip/2026-03-14-lotusnotes.html
1•laacz•20m ago•0 comments

Austin draws another billionaire as Uber co-founder joins California exodus

https://www.statesman.com/business/article/uber-founder-austin-tech-move-robots-22079819.php
2•dmitrygr•20m ago•0 comments

Deep Data Insights for Polymarket Traders

https://www.holypoly.io
1•alexanderstahl•20m ago•1 comments

Show HN: A simple dream to fit in every traveler's pocket

https://www.callzo.io/blog/we-built-callzo-with-dream-of-being-in-every-travellers-pocket
1•mayursinh•21m ago•0 comments