frontpage.

Show HN: AI alignment is an infrastructure problem

1•hortator_ai•1h ago

The most important lesson in IT security is: don't trust the user.

Not "verify then trust." Not "trust but monitor." Just - don't trust them. Assume every user is compromised, negligent, or adversarial. Build your systems accordingly. This principle gave us least privilege, network segmentation, rate limiting, audit logs, DLP. It works.

So why are we treating AI agents like trusted colleagues?

The current alignment discourse assumes we need to make agents want to behave. Instill values. Train away deception. This is the equivalent of solving security by making users trustworthy. We tried that. It doesn't work. You can't patch human nature, and you can't RLHF your way to guaranteed safety.

Here's the thing: every principle from zero-trust security maps directly to agent orchestration.

Least privilege. An agent that writes unit tests doesn't need prod database access. Scope its capabilities via RBAC - same as you'd scope a service account.

Isolation. Each agent runs in its own pod. It can't read another agent's memory, touch its files, or escalate sideways. Same reason you don't run microservices as root in a shared namespace.

Budget enforcement. Token caps and cost limits per agent, per task. An agent that tries to burn $10k on a $5 task gets killed. Like API rate limits, but for cognition.

Audit trails. Full OpenTelemetry tracing on every action, every delegation, every result. You don't need to trust an agent if you can observe everything it does.

PII redaction. Presidio scans agent output before it leaves the pod. Same principle as DLP in enterprise - don't let sensitive data leak, regardless of intent.

Policy enforcement. Declarative policies (CRDs) constrain what agents can and can't do. Like network policies, but for agent behavior.

We built this. It's called Hortator - a Kubernetes operator for orchestrating autonomous AI agent hierarchies. Agents (tribune → centurion → legionary) run in isolated pods with RBAC, budget caps, PII redaction, and full OTel tracing. Everything is a CRD: AgentTask, AgentRole, AgentPolicy. Written in Go, MIT licensed.

We didn't solve alignment. We made it irrelevant by treating agents as untrusted workloads - exactly how we've treated every other piece of software for the last 20 years.

GitHub: https://github.com/hortator-ai/Hortator/

Genuinely curious what this community thinks. Are we wrong to frame alignment as an infrastructure problem? What's the zero-trust model missing when applied to agents? Poke holes - that's what we need.

The Pillars of Agentic Security

Why AI Adoption Stalls, According to Industry Data (HBR)

Show HN: Visualize sentiment of Hacker News comment threads

Do you want to build a community where users search or hang? (2021)

Learning. Again. and Again

Playbook: How to vibe code a successful app

Show HN: Discoding – run AI CLIs locally, relay them to Discord

Show HN: Threema plugin for OpenClaw – no phone number, no gateway needed

What is the new etiquette for tipping?

Notepad++ v8.9.2 Release – Double‑Lock Update Security

DJI Romo bug reportedly exposed live home feeds

Show HN: AudioNimbus – Safe Rust Wrapper for Steam Audio

The left is missing out on AI

Quoting ROUGH DRAFT 8/2/66

Show HN: Quick Issues: A Fast Mobile Issue Capture for GitHub, GitLab, and Gitea

Collecting Important Data Generated by Generative AI

Show HN: PageMap – MCP server that compresses web pages from 100K to 2-5K tokens

Vinyl Cache has left GitHub

AI: Igniting the Spark to End Stagnation

Tesla Sales Down 55% UK, 58% Spain, 59% Germany, 81% Netherlands, 93% Norway

How bad can Python stop-the-world pauses get?

Show HN: Stellar – CLI Theme Manager and Web Hub for Starship Prompts

Anam Cara-3: Why we think AI needs a face

OpenAI, the US government, and persona built an identity surveillance machine

Learning KeyBee

Microsoft's AI Chief Targets AI Self-Sufficiency and OpenAI Independence

The Internet Is Dead

Show HN: Keyfob Analysis Toolkit

I converted 2D conventional flight tracking into 3D

Frege Plagiarized the Stoics