frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: AgentArmor – open-source 8-layer security framework for AI agents

https://github.com/Agastya910/agentarmor
2•AgastyaTodi•2h ago
I've been talking to founders building AI agents across fintech, devtools, and productivity – and almost none of them have any real security layer. Their agents read emails, call APIs, execute code, and write to databases with essentially no guardrails beyond "we trust the LLM."

So I built AgentArmor: an open-source framework that wraps any agentic architecture with 8 independent security layers, each targeting a distinct attack surface in the agent's data flow.

The 8 layers: L1 – Ingestion: prompt injection + jailbreak detection (20+ patterns, DAN, extraction attempts, Unicode steganography) L2 – Storage: AES-256-GCM encryption at rest + BLAKE3 integrity for vector DBs L3 – Context: instruction-data separation (like parameterized SQL, but for LLM context), canary tokens, prompt hardening L4 – Planning: action risk scoring (READ=1 → DELETE=7 → EXECUTE=8 → ADMIN=10), chain depth limits, bulk operation detection L5 – Execution: network egress control, per-action rate limiting, human approval gates with conditional rules L6 – Output: PII redaction via Microsoft Presidio + regex fallback L7 – Inter-agent: HMAC-SHA256 mutual auth, trust scoring, delegation depth limits, timestamp-bound replay prevention L8 – Identity: agent-native identity, JIT permissions, short-lived credentials

I tested it against all 10 OWASP ASI (Agentic Security Integrity) risks from the December 2025 spec. The red team suite is included in the repo.

Works as: (a) a Python library you wrap around tool calls, (b) a FastAPI proxy server for framework-agnostic deployment, or (c) a CLI for scanning prompts in CI.

Integrations included for: LangChain, OpenAI Agents SDK, MCP servers.

I ran it live with a local Ollama agent (qwen2:7b) – you can watch it block a `database.delete` at L8 (permission check), redact PII from file content at L6, and kill a prompt injection at L1 before it ever reaches the model.

GitHub: https://github.com/Agastya910/agentarmor PyPI: pip install agentarmor-core

Would love feedback, especially from people who have actually built production agents and hit security issues I haven't thought of.

TAGS: security, python, llm, ai, agents

Comments

Gnobu•1h ago
Really thorough coverage of the attack surfaces—especially including identity as a core layer. Curious how you handle cross-agent permissions in dynamic workflows: do you rely solely on deterministic checks at each action, or is there a runtime trust evaluation that can adapt as agents interact?
ibrahim_h•5m ago
The pipeline ordering is smart — L8 identity running before anything touches the ingestion layer means a rogue agent gets rejected before it even gets to inject anything. I've seen a couple agent wrappers that run input scanning first and only check identity after, which is just asking for trouble.

One thing I noticed digging through the code though — L4 risk scoring categorizes actions purely by verb. _categorize_action parses the action string for keywords like "read" or "delete" but never looks at params. So read.file targeting /etc/shadow gets a risk score of 1, while delete.file on /tmp/cache.json scores 7. In real agent workloads the target matters as much as the verb — feels like the policy engine could bridge this gap with param-aware rules, since the condition evaluator already supports params.* field resolution.

Also noticed TrustScorer takes a decay_rate in __init__ but never actually applies time-based decay anywhere — trust only changes on interactions. So an agent that was trusted six months ago and went dormant still walks back in with the same score. Small thing but could matter in long-running multi-agent setups.

The MCP rug-pull detection is the standout feature for me. Cross-referencing tool names against their descriptions to catch things like "safe_search" that actually calls exec — haven't seen that anywhere else. With how fast MCP is getting adopted this could get real traction.

Show HN: Channel Surfer – Watch YouTube like it’s cable TV

https://channelsurfer.tv
528•kilroy123•2d ago•156 comments

Show HN: Decision Guardian now comes with CLI

2•iamalizaidi•26m ago•0 comments

Show HN: Context Gateway – Compress agent context before it hits the LLM

https://github.com/Compresr-ai/Context-Gateway
84•ivzak•18h ago•49 comments

Show HN: AgentArmor – open-source 8-layer security framework for AI agents

https://github.com/Agastya910/agentarmor
2•AgastyaTodi•2h ago•2 comments

Show HN: What was the world listening to? Music charts, 20 countries (1940–2025)

https://88mph.fm/
104•matteocantiello•3d ago•46 comments

Show HN: SupplementDEX – The Evidence-Based Supplement Database

https://supplementdex.com/
9•richarlidad•11h ago•0 comments

Show HN: Axe – A 12MB binary that replaces your AI framework

https://github.com/jrswab/axe
216•jrswab•1d ago•121 comments

Show HN: I wrote my first neural network

https://github.com/stupid-genius/Perceptron
6•allenng•11h ago•0 comments

Show HN: Svglib a SVG parser and renderer for Windows

https://github.com/bibhas2/svglib
13•leopoldj•3d ago•1 comments

Show HN: Chat Daddy – all your LLM chats in a super light terminal

https://lucianlabs.ca/blog/chat-daddy.html
2•elijahlucian•7h ago•0 comments

Show HN: OneCLI – Vault for AI Agents in Rust

https://github.com/onecli/onecli
156•guyb3•1d ago•49 comments

Show HN: Hardened OpenClaw on AWS with Terraform

https://github.com/infrahouse/terraform-aws-openclaw
7•aleks2•14h ago•1 comments

Show HN: Rudel – Claude Code Session Analytics

https://github.com/obsessiondb/rudel
141•keks0r•1d ago•83 comments

Show HN: Understudy – Teach a desktop agent by demonstrating a task once

https://github.com/understudy-ai/understudy
114•bayes-song•1d ago•41 comments

Show HN: Vibe-budget – CLI to estimate LLM costs before you start vibe coding

https://www.npmjs.com/package/vibe-budget
2•gabriel_quec016•9h ago•0 comments

Show HN: s@: decentralized social networking over static sites

http://satproto.org/
410•remywang•2d ago•219 comments

Show HN: Kube-pilot – AI engineer that lives in your Kubernetes cluster

https://github.com/fbongiovanni29/kube-pilot
2•noobernetes•9h ago•0 comments

Show HN: AgentLog – a lightweight event bus for AI agents using JSONL logs

https://github.com/sumant1122/agentlog
6•paperplaneflyr•17h ago•0 comments

Show HN: Anthrology – Time-Traveling Radio

https://anthrology.site/
7•airstrike•15h ago•5 comments

Show HN: Mutate – free inline text replacement for Mac

https://github.com/robert-v/Mutate-public
3•rob3rth•16h ago•1 comments

Show HN: Open-source browser for AI agents

https://github.com/theredsix/agent-browser-protocol
154•theredsix•2d ago•53 comments

Show HN: Simple plugin to get Claude Code to listen to you

https://www.gopeek.ai
14•itsankur•12h ago•4 comments

Show HN: DJX – Convention over Configuration for Django (Rails-Inspired CLI)

5•RedsonNgwira•16h ago•2 comments

Show HN: Fatal Core Dump – a debugging murder mystery played with GDB

https://www.robopenguins.com/fatal_core_dump/
3•axlan•11h ago•0 comments

Show HN: Vanilla JavaScript refinery simulator built to explain job to my kids

https://fuelingcuriosity.com/game.html
125•fuelingcurious•2d ago•48 comments

Show HN: I built a tool that watches webpages and exposes changes as RSS

https://sitespy.app
318•vkuprin•2d ago•77 comments

Show HN: EdgeWhisper – On-device voice-to-text for macOS (Voxtral 4B via MLX)

https://edgewhisper.com
2•raphaelmansuy•13h ago•1 comments

Show HN: Web-based ANSI art viewer

https://sure.is/ansi/
28•lubujackson•4d ago•7 comments

Show HN: Sapphire – A portable language with native UI and 3D vectors

https://github.com/foxzyt/Sapphire
3•foxz•13h ago•0 comments

Show HN: OpenClaw-class agents on ESP32 (and the IDE that makes it possible)

https://pycoclaw.com/
28•pycoclaw•1d ago•4 comments