frontpage.

Show HN: Security-Risk Patterns in OpenClaw Skills

https://safeclaw.io/

2•dinodrv•1h ago

I built a static analysis scanner that checks OpenClaw agent skill definitions.

Here's every category I found on ClawHub.

Hidden Content: HTML comments with instructions, zero-width Unicode characters (U+200B-U+200F, U+2060-2064, U+FEFF), CSS hiding (display:none, opacity:0), and bidirectional text overrides. These are invisible when reading markdown but the LLM processes them.

Prompt Injection: Direct attempts to override agent behavior: "ignore previous instructions", role reassignment ("you are now"), model-specific tokens like [INST] and <|im_start|>, and persona manipulation ("pretend you are").

Shell Execution: Remote code execution vectors: curl|bash, eval(), exec(), npx -y (auto-confirms remote packages), reverse shells via /dev/tcp or nc -e, and one-liners in Python, PHP, Perl, Ruby.

Data Exfiltration: URLs pointing to paste sites (pastebin, transfer.sh), webhook services (ngrok, webhook.site, pipedream), messaging webhooks (Slack, Discord, Telegram bot API), and raw IP addresses.

Embedded Secrets: Hardcoded credentials across 17 types: AWS keys, OpenAI API keys, GitHub/GitLab tokens, Stripe keys, PEM private keys, JWT tokens, database connection strings, SSH private keys, and more.

Sensitive File References: Instructions to access .ssh/, .env, .aws/credentials, /etc/passwd, /etc/shadow, and private key paths.

Memory/Config Poisoning: This one is interesting. Skills that try to write to agent memory files (CLAUDE.md, SOUL.md, MEMORY.md, CODEX.md) or IDE rule files (.cursorrules, .windsurfrules, .clinerules). This creates persistence - the injected instructions survive across sessions.

Supply Chain Risk: External script downloads from raw GitHub URLs, and package install commands (npm install, pip install, gem install, cargo install, go install, brew install). A skill shouldn't be silently installing packages.

Encoded Payloads: Base64 strings over 40 characters, atob()/btoa() calls, Buffer.from(..., 'base64'), hex escape sequences, and String.fromCharCode(). Encoding is used to bypass pattern detection in other scanners.

Image Exfiltration: This is the most complex category with 17 patterns. Markdown images with exfil query params (), variable interpolation in image URLs (), SVG with embedded scripts or foreignObject, 1x1 tracking pixels, CSS-hidden image beacons, steganography tool references, Canvas API manipulation (getImageData, toDataURL), and double extensions (.png.exe).

System Prompt Extraction: Instructions to leak the agent's system prompt: "reveal your system prompt", "repeat the words above", "print everything above", "what are your original instructions".

Argument Injection: Shell metacharacters in tool arguments: command substitution $(), variable expansion ${}, backticks, chained commands (;rm, |bash, &&curl), and GTFOBINS exploitation flags (--exec, --checkpoint-action).

Cross-Tool Chaining: Multi-step attack patterns that combine legitimate tools: read-then-exfiltrate sequences, numbered step-by-step instructions, and direct tool function references (read_file(), execute_command()). Each step looks harmless alone.

Excessive Permissions: Requests for "unrestricted access", "bypass security", "root access", "disable all safety checks", "full system control". A skill definition shouldn't need these.

Suspicious Structure: Content over 10K characters (larger surface area for hiding threats), and imperative instruction density over 30% (lines starting with "you must", "always", "never", "execute", "run").

How it works ? The scanner is stateless. You paste or upload a skill definition, it runs 15 analyzers against the content, and returns findings with severity levels, line numbers, evidence snippets, and OWASP LLM Top 10 references.

No database, no persistence, no network calls. Single request in, results out.

Beating GPT-2 for less than $100 – Andrej Karpathy

Show HN: Bulwark – Open-source governance layer for AI agents (Rust, MCP-native)

Ask HN: Best roles in tech where I can be in meetings mostly?

Vulnerabilities in cloud-based password managers [pdf]

Ask HN: Which password manager do you use / would you recommend?

Linux CVE Assignment Process

Lack of measurement invariance in mental health across intelligence levels

Show HN: Krea iPad – real-time editing model with Apple Pencil input

Dark web agent spotted bedroom wall clue to rescue girl from abuse

Meta: Messenger.com is no longer available for messaging

OddsRabbit- Reddit Alternative that doesn't allow politics. Only hobbies

The AWS Marketplace Race Condition Nobody Warns You About

Humanoids go mainstream as China's robotics champions appear at CCTV spectacle

The claws are open, until they close around you, out of your control

Friday CLI: The first multi-modal CLI Agent (chat/voice/video/images)

Is End-to-End Encryption Optional for Large Groups?

Nimslo stereo camera

Cowork: Claude Code Power for Knowledge Work

More macOS 26.3 Finder column view silliness

This Is What Destroying the Vaccine Market Looks Like

White House uses USAID funds for budget director Vought's security

Interleaved HTML Streaming (Patching)

Walking Duluth

Why I Built Reader: Open-source web scraping for LLMs

OpenClaw and the Great Hiring Hiatus

The Universal Code

New GitHub repository settings to configure pull request access

GrowthClaw, Distribution Infrastructure for OpenClaw

Economic Espionage and Innovation Restrictions (2025) [pdf]

AI is destroying Open Source, and it's not even good yet