Show HN: Scanning 277 AI agent skills for security issues

https://www.clawdefend.com/

2•pakmania•2h ago

Comments

pakmania•2h ago

Six weeks ago I got curious what’s actually inside the AI agent “skills” people install from ClawHub, not the descriptions, but the source code.

So I built a scanner.

It pulls skill source from GitHub, runs a set of static analysis checks (shell execution patterns, environment variable access, hardcoded credentials, SSRF patterns, eval usage, basic obfuscation detection, etc.), and then runs a second pass using an LLM to classify whether the flagged pattern looks contextual vs. potentially risky.

So far I’ve scanned 277 public skills.

Some aggregate observations:

70% triggered at least one static rule

9,710 total findings across all scans

Common patterns included unsanitized shell execution and unrestricted environment variable reads

Important caveats:

Many findings are low severity.

Static analysis is noisy.

“70%” means at least one rule triggered — not that 70% are malicious.

No dynamic/runtime execution — this is source-based analysis only.

Binary-only skills are conservatively capped due to limited visibility.

The tool is live at clawdefend.com — you can paste any ClawHub or GitHub skill URL and get a report in ~30 seconds. No login required.

There’s also a simple API if you want to integrate scans into CI or publishing workflows.

Curious how others are thinking about security models for agent marketplaces. Is static + contextual classification reasonable here, or is there a better approach?

Solo project. Happy to go deeper on methodology.

openclawed•1h ago

This is interesting. I'm going to scan some of the skills I have installed and see if it finds any issues. We need reliable scanners for these skills.

pakmania•1h ago

Thanks, let me know what you think about the results and if you run into any issues. There's also a Contact & Support link at the bottom of the page.

Patterns in AI-Augmented Software Development

The six dumbest ideas in computer security (2005)

Google accelerates Chrome release cycle

You are going to get priced out of the best AI coding tools

Helicone Is Joining Mintlify

Show HN: Letting Claude automate fleets of browser sandboxes

The 5am myth: Waking early won't make you more successful

GPT‑5.3 Instant

Show HN: Qgate – Classical trajectory filtering for noisy quantum circuits

Show HN: BaseCFO – workbooks as a queryable dashboard for fractional CFOs

Show HN: Viib – Generate production ready static ads from a single URL

1995: From Batman Forever's cinematic design to HTML tables

Show HN: t-req – Open-source programmable API engine built on .http files

Lucent: YouTube Focus Mode and Auto 4K open source

Show HN: MuninnDB – ACT-R decay and Hebbian memory for AI agents

Microgpt on the ESP32 – But Why?

Shattered Glass (1998)

Upgrading the Samsung Trifold battery by 71% using SiC

Ask HN: How do solo founders find academic co-founders for STTR grants?

Would You Buy Generic AI?

Show HN: Arbor – AI research workbench, question to knowledge graph

PEP 827 – Type Manipulation

Regenerator 2000: interactive disassembler for the C64 and other 6502 systems

CEOs are betting big on AI while barely using it

The AI Bubble Is an Information War

Google violates its 14-day deprecation policy for Gemini 3 Pro Preview

US Stock Market has lost $1 TRILLION in value since open Tuesday

A lightweight, embeddable Prolog interpreter written in C11

Blackberry Growth Monitoring and Feature Quantification with UAV Remote Sensing

The Court's (Selective) Impatience Is a Vice