frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Orkia – a Rust runtime where AI agents can't bypass governance

https://github.com/orkiaHQ/orkia
1•killix•1h ago
Orkia is an open-source Rust runtime for LLM agents where policy enforcement, trust scoring, and audit trails are wired into the execution loop at the type-system level.

No code path exists that executes a tool without passing through governance. Fail-closed by default, signed session evidence (ECDSA P-256), and agents that earn autonomy through demonstrated behavior.

Apache 2.0.

Comments

killix•1h ago
Hey HN, author here. Some context on why I built this and what's interesting technically.

I was deploying LLM agents for business processes and kept hitting the same problem: every agent framework defaults to "allow everything." No policy configured? All tools available. No audit? Hope your logs are enough. No trust model? Same permissions on day one as day one thousand.

Orkia flips every default.

Fail-closed by default. No policy rule matching a tool call = denied. Not "allowed until someone writes a deny rule." This is the opposite of how most frameworks work, and it's the single decision that shapes everything else.

Trust earned, not granted. Agents start restricted and gain autonomy through behavior. ATLAS tracks 4 dimensions (task completion, policy compliance, resource usage, audit completeness) and computes an autonomy level. The key insight: trust scores are keyed on SHA-256 of the canonical agent config. Change the model, tools, or instructions, trust resets to zero. No stale trust carries over.

Signed evidence, not logs. Every session produces a SEAL artifact, an ECDSA P-256 signature binding the runtime binary hash + config fingerprint + full governance event chain. It's not "we logged what happened." It's "we can prove which software version, running which config, produced which sequence of events." orkia verify checks it, orkia check gates your CI pipeline.

Sensitivity labels are monotone by construction. LabelSet wraps BTreeSet<DataLabel> and exposes insert/union but literally has no remove/clear method. Once data is classified, it stays classified. You can't break this property because the API won't let you compile code that tries.

MCP tool injection scanner. External MCP servers can embed prompt injections in tool descriptions (the text goes straight into the LLM system prompt). Orkia scans tool definitions for instruction overrides, exfiltration patterns, and zero-width characters before they're registered.

The loop guard has 6 detection layers running before policy evaluation: circuit breaker, outcome-aware dedup (same tool + same params + same result = faster escalation), ping-pong pattern detection (A-B-A-B cycles), proportional dominance (one tool consuming >80% of calls), per-tool rate limits, and warning escalation.

The architecture doc (ARCHITECTURE.md) goes deep on every design decision if you want to poke holes. Would love feedback, especially from people building agent systems in production or anyone who thinks the fail-closed default is wrong.

Ask HN: How do solo founders find academic co-founders for STTR grants?

1•Rao_Atreya•14s ago•0 comments

Would You Buy Generic AI?

https://tomtunguz.com/white-label-ai/
1•swolpers•28s ago•0 comments

Show HN: Arbor – AI research workbench, question to knowledge graph

https://www.arborinquiries.com/
1•FlynnLachendro•1m ago•0 comments

PEP 827 – Type Manipulation

https://peps.python.org/pep-0827/
1•pboulos•2m ago•0 comments

Regenerator 2000: interactive disassembler for the C64 and other 6502 systems

https://regenerator2000.readthedocs.io/en/latest/
2•homarp•2m ago•1 comments

CEOs are betting big on AI while barely using it

https://www.charterworks.com/ceos-are-betting-big-on-ai-while-barely-using-it/
2•swolpers•2m ago•0 comments

The AI Bubble Is an Information War

https://www.wheresyoured.at/the-ai-bubble-is-an-information-war/
2•spking•4m ago•0 comments

Google violates its 14-day deprecation policy for Gemini 3 Pro Preview

2•goolulusaurs•5m ago•0 comments

US Stock Market has lost $1 TRILLION in value since open Tuesday

https://old.reddit.com/r/StockMarket/comments/1rjtww8
1•ck2•6m ago•0 comments

A lightweight, embeddable Prolog interpreter written in C11

https://github.com/no382001/prolog
1•triska•8m ago•0 comments

Blackberry Growth Monitoring and Feature Quantification with UAV Remote Sensing

https://www.mdpi.com/2624-7402/6/4/260
1•PaulHoule•9m ago•0 comments

The Court's (Selective) Impatience Is a Vice

https://www.stevevladeck.com/p/214-the-courts-selective-impatience
1•hn_acker•9m ago•1 comments

Show HN: Boosted LightFace – A Hybrid DNN and GBM Model for Facial Recognition

https://dergipark.org.tr/en/pub/gujs/article/1794891
1•serengil•9m ago•0 comments

Isn't P2P WebRTC better than SSH for connecting to Mac terminal from iPhone?

https://macky.dev/#architecture
1•eureka_boy•9m ago•2 comments

Anthropic's Claude sees 'elevated errors' as it tops Apple's free apps

https://www.cnbc.com/2026/03/02/anthropic-claude-ai-outage-apple-pentagon.html
1•LostMyLogin•9m ago•1 comments

Bio-Inspired Adapters: Improving Models Beyond LoRA Fine-Tuning

https://www.genbais.com/
1•lazarko•11m ago•0 comments

Show HN: Design Jam, ASCII wireframes and annotations that export as AI prompts

https://getdesignjam.com
1•Adrig•11m ago•0 comments

Show HN: Free Math Sheets – Generate math worksheets for K-5 problems

https://www.freemathsheets.com/
1•mchaver•12m ago•0 comments

What the First Billionaire Reveals About the First Trillionaire

https://www.bloomberg.com/news/features/2026-02-26/elon-musk-and-the-first-trillionaire-what-rock...
3•robtherobber•12m ago•0 comments

A New Rembrandt Discovered

https://www.rijksmuseum.nl/en/stories/themes/rembrandt/story/a-new-rembrandt-discovered
1•Tomte•12m ago•0 comments

What AI-justified mass layoffs reveal about what we were never owed

https://codeplusconduct.substack.com/p/grateful-for-your-contributions
1•mooreds•13m ago•0 comments

Show HN: I rewrote an inventory app 4 times over 5 years before releasing v1

https://upzonehq.com/
1•florentmsl•13m ago•0 comments

Floyd is an enterprise-level world model

https://www.loom.com/share/7b3ba36113e446548f3a79cf5fc1e42c
1•tjarzu•15m ago•0 comments

Walk me through this "Safety Third" thing

https://mikerowe.com/2020/03/walk-me-through-this-safety-third-thing/
2•andsoitis•15m ago•0 comments

Perplexity Computer Is Groundbreaking

https://karozieminski.substack.com/p/perplexity-computer-review-examples-guide
2•Lunaboo•18m ago•0 comments

Jack Dorsey Blamed AI for Block's Layoffs. Skeptics Aren't Buying It

https://www.wsj.com/business/jack-dorseys-latest-far-out-bet-an-ai-future-with-fewer-employees-25...
2•nradov•18m ago•0 comments

A new 'uncertainty relation' for quantum measurement errors

https://phys.org/news/2026-03-uncertainty-quantum-errors.html
2•bikenaga•18m ago•1 comments

Building an Elite AI Engineering Culture in 2026

https://www.cjroth.com/blog/2026-02-18-building-an-elite-engineering-culture
1•mooreds•18m ago•0 comments

Idaho considers an 'apocalyptic' choice for disabled people and families

https://19thnews.org/2026/03/idaho-medicaid-budget-cuts-disability-programs/
1•mooreds•18m ago•0 comments

Where AI Agents Are Heading: What We Learned from Recent YC Startups

https://e2b.dev/blog/yc-companies-ai-agents
1•tizkovatereza•22m ago•2 comments