frontpage.

  Hey HN,

  I built Aguara because I kept seeing the same problem: AI agents and MCP servers
  run code on your behalf, and nobody is checking what that code actually does before
  it runs.

  A single malicious skill file can exfiltrate your SSH keys, inject prompts to
  override safety instructions, or curl-pipe-bash a backdoor. I wanted something
  like Semgrep but specifically for the AI agent ecosystem.

  Aguara is a Go binary that does static analysis on skill files (markdown,
  YAML, JSON configs). It's offline, deterministic, no LLM, no API keys needed.

  What it catches:
  - Prompt injection (instruction overrides, jailbreaks, delimiter injection)
  - Data exfiltration (webhook URLs, DNS tunneling, env var leaks)
  - Credential leaks (OpenAI/AWS/GCP keys, private keys, DB connection strings)
  - Supply-chain attacks (curl|bash, binary download+execute, unpinned npx)
  - MCP-specific threats (tool injection, privileged docker, shell metacharacters)
  - 138 rules across 15 categories total

  It goes beyond regex — there's NLP-based markdown structure analysis (using
  goldmark AST walking) to catch things like hidden instructions in HTML comments,
  and taint tracking to detect dangerous capability combinations (e.g., a skill
  that reads private data AND has network access).

  I also built Aguara Watch (https://watch.aguarascan.com/) which continuously
  scans 31,000+ public AI agent skills across 5 registries (skills.sh, ClawHub,
  PulseMCP, mcp.so, LobeHub). The scan data is open — you can query any skill's
  security report via a static JSON API.

  Some numbers from scanning the entire public ecosystem:
  - 31,330 skills scanned
  - ~2,330 with security findings (7.4%)
  - 448 critical findings (mostly curl|bash, hardcoded keys, jailbreak prompts)

Step-by-Step Math Problem Solver

Ask HN: Anyone else exhausted by too rapid development of news?

Show HN: Trained an LLM to predict "What will Trump do?"

AI Authorship Scale

GNU/Linux Open Hardware PowerPC Notebook

Tesla to pay $243M judgement over Autopilot crash

Show HN: Apple Notes CLI for Agents

Building a $1.7B AI Training giant: Multiverse, Stackfuel and what comes next

Game Publisher Says TikTok Is Creating and Running Racist GenAI Ads for It

Color Game – How well can you remember colors?

Beware Project-Wrecking GitHub Copilot Premium SKU Quotas

Deciphering D – a new flu strain discovered in 2011

Claude Skills to analyze LinkedIn job postings against your career profile

Show HN: Schedrs – a minimal and simple scheduler benchmark written in Rust

Training Your Replacement

Show HN: Coco Ear Training – a research-backed ear training app for musicians

Vibe Coding at the End of 2023

Amazon's cloud 'hit by two outages caused by AI tools last year'

What Breaks When You Sprint with 10 AI Agents

Reverse engineering Microsoft (windows) messenger protocol om macOS

Monolith – The research paper behind TikToks algorithm (2022)

Show HN: IssuePay Browser Extension – Calculate your Score for any Job Offer

Show HN: LogSentry – Static analysis for logging quality (100% local)

Reduce the JavaScript Workload with no- or lo-JS options

3nm Chips Don't Have 3nm Features

CSS boilerplate: A default CSS structure for projects of any size

Drop the WWW Prefix

Family deepfakes help people celebrate and grieve in India

An AI coding bot took down Amazon Web Services

Thoughts on Standardizing Languages

Show HN: Aguara – Security scanner for AI agent skills and MCP servers

Step-by-Step Math Problem Solver

Ask HN: Anyone else exhausted by too rapid development of news?

Show HN: Trained an LLM to predict "What will Trump do?"

AI Authorship Scale

GNU/Linux Open Hardware PowerPC Notebook

Tesla to pay $243M judgement over Autopilot crash

Show HN: Apple Notes CLI for Agents

Building a $1.7B AI Training giant: Multiverse, Stackfuel and what comes next

Game Publisher Says TikTok Is Creating and Running Racist GenAI Ads for It

Color Game – How well can you remember colors?

Beware Project-Wrecking GitHub Copilot Premium SKU Quotas

Deciphering D – a new flu strain discovered in 2011

Claude Skills to analyze LinkedIn job postings against your career profile

Show HN: Schedrs – a minimal and simple scheduler benchmark written in Rust

Training Your Replacement

Show HN: Coco Ear Training – a research-backed ear training app for musicians

Vibe Coding at the End of 2023

Amazon's cloud 'hit by two outages caused by AI tools last year'

What Breaks When You Sprint with 10 AI Agents

Reverse engineering Microsoft (windows) messenger protocol om macOS

Monolith – The research paper behind TikToks algorithm (2022)

Show HN: IssuePay Browser Extension – Calculate your Score for any Job Offer

Show HN: LogSentry – Static analysis for logging quality (100% local)

Reduce the JavaScript Workload with no- or lo-JS options

3nm Chips Don't Have 3nm Features

CSS boilerplate: A default CSS structure for projects of any size

Drop the WWW Prefix

Family deepfakes help people celebrate and grieve in India

An AI coding bot took down Amazon Web Services

Thoughts on Standardizing Languages