frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Agent Audit – Open-source security scanner for AI agents

https://github.com/HeadyZhang/agent-audit
1•HaiyueZhang•1h ago

Comments

HaiyueZhang•1h ago
Hey HN,

I built Agent Audit because I kept seeing the same security anti-patterns in AI agent code — eval() on LLM outputs, subprocess.run(shell=True) with unvalidated tool inputs, system prompts constructed with f-strings. Traditional SAST tools flag some of these, but they don't understand the agent context.

What it does:

Static analysis for Python agent code (LangChain, CrewAI, AutoGen, etc.)

Scans MCP server configurations (Claude Desktop, etc.)

40+ rules mapped to the OWASP Agentic Top 10 [1]

Outputs SARIF for GitHub Code Scanning integration

Example of what it catches:

@tool def search(query: str) -> str: # AGENT-041: SQL injection via f-string return db.execute(f"SELECT * FROM docs WHERE content LIKE '%{query}%'")

Traditional scanners might flag the SQL injection. Agent Audit also flags that this is a @tool function — meaning an LLM chooses what query contains, not a user form. The attack surface is different.

What it doesn't do:

No cross-file taint analysis (yet) — currently intra-procedural only

Python only — TypeScript MCP servers need separate tooling

Static analysis limitations apply — can't catch runtime-only issues

Some numbers from scanning open-source projects:

I scanned 6 popular agent repos (OpenHands, SWE-agent, Gorilla, etc.). Found 617 findings total, 269 critical. Most common issue: tool functions that pass LLM-provided strings directly to dangerous sinks without validation.

Full report: https://github.com/HeadyZhang/agent-audit/blob/master/docs/r...

Install:

pip install agent-audit agent-audit scan .

MIT licensed. Happy to answer questions about the detection approach or agent security patterns in general.

[1] https://genai.owasp.org/resource/owasp-top-10-for-agentic-ap...

Backup Comments:

Q: "How does this compare to Semgrep?"

Semgrep is great for general SAST and I'd recommend using both. The difference:

Semgrep: "Is this a SQL injection?" (pattern-based)

Agent Audit: "Is this a tool function where an LLM controls the input, and does that input flow to a SQL query without validation?" (context-aware)

We also scan MCP configs, which is a JSON/YAML format Semgrep doesn't have rules for.

Think of it as: Semgrep for your web app, Agent Audit for your agent layer.

Q: "Aren't these just regular vulnerabilities?"

Yes and no. The vulnerabilities themselves (SQLi, command injection) are classic. What's new is:

The attack surface — You're not validating user form input; you're trusting LLM output

The trigger — Prompt injection can cause an agent to call tools maliciously

The context — Agent-specific patterns like missing max_iterations, memory poisoning, inter-agent auth

A function with eval() is always risky. But eval() inside a @tool decorator is a different threat model.

Q: "False positive rate?"

We use confidence scoring (0.0-1.0) and tier findings as BLOCK/WARN/INFO. High-confidence (BLOCK tier) findings have low FP rate in our benchmarks. Lower tiers need human review.

You can also use baseline mode to only see new findings in PRs.

Q: "Why not just sandbox everything?"

You should! But defense in depth matters. Agent Audit catches issues before runtime. Sandboxing catches issues at runtime. Both are valuable.

Also, many agent deployments don't use proper sandboxing — that's one of the things we flag (AGENT-047: unsandboxed subprocess).

Q: "What about JavaScript/TypeScript agents?"

Not yet, but it's on the roadmap. Python is where most of the agent frameworks are today (LangChain, CrewAI, AutoGen, DSPy). Happy to accept contributions for TS support.

Show HN: 33rpm – A vinyl screensaver for macOS that syncs to your music

https://33rpm.noonpacific.com/
1•kaniksu•1m ago•0 comments

Google Workers Demand End to Cloud Services for Immigration Agencies

https://www.nytimes.com/2026/02/06/business/google-employees-protest.html
1•donohoe•1m ago•0 comments

Gravity = Magnetism and Heat

https://zenodo.org/records/18498514
1•phdlalala•7m ago•0 comments

Debian project leader warns of developers' silent withdrawal

https://www.heise.de/en/news/Debian-Project-leader-warns-of-developers-silent-withdrawal-11167886...
2•layer8•7m ago•0 comments

Your Agent's Memory Is Broken. Here's Why

https://ramsriharsha.substack.com/p/your-agents-memory-is-broken-heres
1•riemannzeta•10m ago•0 comments

Show HN: I built a YC-style interview practice tool for myself, now public

https://www.getycready.com/
1•zacharykapank•12m ago•1 comments

Continual learning and the post monolith AI era

https://www.baseten.co/resources/research/continual-learning/#introduction
1•jxmorris12•15m ago•0 comments

Detecting backdoored language models at scale

https://www.microsoft.com/en-us/security/blog/2026/02/04/detecting-backdoored-language-models-at-...
1•geox•18m ago•0 comments

$530B in AI Capex looks terrible if you forget how accounting works

https://deadneurons.substack.com/p/the-cloud-hyperscalers-are-starting
1•nr378•19m ago•0 comments

Ask HN: Non AI-obsessed tech forums

4•nanocat•22m ago•2 comments

Show HN: A React testing boilerplate built for vibe coding

https://www.testsolid.com/
1•dudeskey•23m ago•0 comments

Persistent Memory for OpenClaw/Moltbot/Clawdbot

https://mem0.ai/blog/mem0-memory-for-openclaw
1•ninadwrites•24m ago•0 comments

Executive Function as Code: using (Doom) Emacs to script my brain

https://milly.kittycloud.eu/posts/executive-function-as-code-doom-emacs-adhd/
2•0xMillyByte•24m ago•0 comments

Ultima IX

https://www.filfre.net/2026/02/ultima-ix/
2•doppp•25m ago•0 comments

Will firms try to combine software developer and product manager roles?

https://bjornwestergard.com/firms-combine-software-roles/
1•bwestergard•26m ago•0 comments

Show HN: Chiptune Tracker

https://chiptunes.netlify.app
2•iamdan•27m ago•0 comments

Words That Mean Nothing

https://dogdogfish.com/blog/2026/02/06/words-that-mean-nothing/
2•matthewsharpe3•30m ago•0 comments

Show HN: Falcon's Eye (isometric NetHack) running in the browser via WebAssembly

https://rahuljaguste.github.io/Nethack_Falcons_Eye/
1•rahuljaguste•31m ago•1 comments

Claude Opus 4.6 vs. GPT-5.3-Codex: AI Model Showdown

https://badlucksbane.com/posts/claude-opus-4-6-vs-gpt-5-3-codex-the-ai-model-showdown.html
1•IAmNeo•32m ago•0 comments

Show HN: Vibe coded real-time Super Bowl Squares app (Claude Code and Opus 4.5)

https://defirate.com/squares/
1•ksaville•33m ago•0 comments

The Problem with Silicon Carbon Batteries [video]

https://www.youtube.com/watch?v=zPAY2VxfFBk
1•mgh2•34m ago•0 comments

Gizmo: A TikTok for interactive, vibe-coded mini apps

https://techcrunch.com/2026/02/04/meet-gizmo-a-tiktok-for-interactive-vibe-coded-mini-apps/
1•fcpguru•36m ago•0 comments

Both GCC and Clang generate strange/inefficient code

https://codingmarginalia.blogspot.com/2026/02/both-gcc-and-clang-generate.html
4•rsf•40m ago•2 comments

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

https://github.com/phreda4/r3
9•phreda4•40m ago•0 comments

Pilot mistakenly attempted to take off from a taxiway at Brussels Airport

https://www.brusselstimes.com/1956996/pilot-accidentally-takes-off-on-the-wrong-lane-at-brussels-...
1•susam•41m ago•0 comments

One Year of Using Go

https://rugu.dev/en/blog/one-year-of-go/
1•kugurerdem•41m ago•0 comments

Show HN: Ty-lsp skill for coding agents

https://github.com/agentic-utils/claude-plugins
1•brtkwr•42m ago•0 comments

Choose to be the person you need the most

https://blog.aintapp.com/be-who-you-need-the-most/
1•i_k•44m ago•0 comments

Using a Jailbroken Gemini to Make Opus 4.6 Architect a Kinetic Kill Vehicle

https://recursion.wtf/posts/shadow_queen/
1•inanna_malick•46m ago•2 comments

Visualize MySQL query execution plans as interactive FlameGraphs

https://github.com/vgrippa/myflames
1•tanelpoder•48m ago•0 comments