Show HN: Orkia – a Rust runtime where AI agents can't bypass governance

https://github.com/orkiaHQ/orkia

1•killix•1h ago

Orkia is an open-source Rust runtime for LLM agents where policy enforcement, trust scoring, and audit trails are wired into the execution loop at the type-system level.

No code path exists that executes a tool without passing through governance. Fail-closed by default, signed session evidence (ECDSA P-256), and agents that earn autonomy through demonstrated behavior.

Apache 2.0.

Comments

killix•1h ago

Hey HN, author here. Some context on why I built this and what's interesting technically.

I was deploying LLM agents for business processes and kept hitting the same problem: every agent framework defaults to "allow everything." No policy configured? All tools available. No audit? Hope your logs are enough. No trust model? Same permissions on day one as day one thousand.

Orkia flips every default.

Fail-closed by default. No policy rule matching a tool call = denied. Not "allowed until someone writes a deny rule." This is the opposite of how most frameworks work, and it's the single decision that shapes everything else.

Trust earned, not granted. Agents start restricted and gain autonomy through behavior. ATLAS tracks 4 dimensions (task completion, policy compliance, resource usage, audit completeness) and computes an autonomy level. The key insight: trust scores are keyed on SHA-256 of the canonical agent config. Change the model, tools, or instructions, trust resets to zero. No stale trust carries over.

Signed evidence, not logs. Every session produces a SEAL artifact, an ECDSA P-256 signature binding the runtime binary hash + config fingerprint + full governance event chain. It's not "we logged what happened." It's "we can prove which software version, running which config, produced which sequence of events." orkia verify checks it, orkia check gates your CI pipeline.

Sensitivity labels are monotone by construction. LabelSet wraps BTreeSet<DataLabel> and exposes insert/union but literally has no remove/clear method. Once data is classified, it stays classified. You can't break this property because the API won't let you compile code that tries.

MCP tool injection scanner. External MCP servers can embed prompt injections in tool descriptions (the text goes straight into the LLM system prompt). Orkia scans tool definitions for instruction overrides, exfiltration patterns, and zero-width characters before they're registered.

The loop guard has 6 detection layers running before policy evaluation: circuit breaker, outcome-aware dedup (same tool + same params + same result = faster escalation), ping-pong pattern detection (A-B-A-B cycles), proportional dominance (one tool consuming >80% of calls), per-tool rate limits, and warning escalation.

The architecture doc (ARCHITECTURE.md) goes deep on every design decision if you want to poke holes. Would love feedback, especially from people building agent systems in production or anyone who thinks the fail-closed default is wrong.

Ask HN: How do solo founders find academic co-founders for STTR grants?

Would You Buy Generic AI?

Show HN: Arbor – AI research workbench, question to knowledge graph

PEP 827 – Type Manipulation

Regenerator 2000: interactive disassembler for the C64 and other 6502 systems

CEOs are betting big on AI while barely using it

The AI Bubble Is an Information War

Google violates its 14-day deprecation policy for Gemini 3 Pro Preview

US Stock Market has lost $1 TRILLION in value since open Tuesday

A lightweight, embeddable Prolog interpreter written in C11

Blackberry Growth Monitoring and Feature Quantification with UAV Remote Sensing

The Court's (Selective) Impatience Is a Vice

Show HN: Boosted LightFace – A Hybrid DNN and GBM Model for Facial Recognition

Isn't P2P WebRTC better than SSH for connecting to Mac terminal from iPhone?

Anthropic's Claude sees 'elevated errors' as it tops Apple's free apps

Bio-Inspired Adapters: Improving Models Beyond LoRA Fine-Tuning

Show HN: Design Jam, ASCII wireframes and annotations that export as AI prompts

Show HN: Free Math Sheets – Generate math worksheets for K-5 problems

What the First Billionaire Reveals About the First Trillionaire

A New Rembrandt Discovered

What AI-justified mass layoffs reveal about what we were never owed

Show HN: I rewrote an inventory app 4 times over 5 years before releasing v1

Floyd is an enterprise-level world model

Walk me through this "Safety Third" thing

Perplexity Computer Is Groundbreaking

Jack Dorsey Blamed AI for Block's Layoffs. Skeptics Aren't Buying It

A new 'uncertainty relation' for quantum measurement errors

Building an Elite AI Engineering Culture in 2026

Idaho considers an 'apocalyptic' choice for disabled people and families

Where AI Agents Are Heading: What We Learned from Recent YC Startups