frontpage.

Show HN: Open-sourced AI Agent runtime (YAML-first)

https://github.com/NikoSokratous/agentctl

1•nsokra02•2h ago

Been running AI agents in production for a while and kept running into the same issues:

controlling what they can do tracking costs debugging failures making it safe for real workloads

So we built AgentRuntime, the infrastructure layer we wished we had. Not an agent framework, but the platform around agents:

policies memory workflows observability cost tracking RAG governance

Agents and policies are defined in YAML, so it's infrastructure-as-code rather than a chatbot builder. Example – agents and policies in YAML agent.yaml – declarative agent config name: support_agent

model: provider: anthropic name: claude-3-5-sonnet

context_assembly: enabled: true

  embeddings:
    provider: openai
    model: text-embedding-3-small

  providers:
    - type: knowledge
      config:
        sources: ["./docs"]
        top_k: 3

policies/safety.yaml – governance as code name: security-policy

rules: - id: block-file-deletion condition: tool.name == "file_delete" action: deny

CLI – run and inspect Create and run an agent agentctl agent create researcher --goal "Research AI safety" --llm gpt-4 agentctl agent run researcher agentctl runs watch <run-id>

Manage policies agentctl policy list agentctl policy activate security-policy 1.0.0

RAG – ingest docs and ground responses in your knowledge base agentctl context ingest ./docs agentctl run --agent agent.yaml --goal "How do I deploy?"

Agent-level debugging agentctl debug -c agent.yaml -g "Analyze this dataset."

Cost tracking is exposed via the API (per agent/tenant), and the Web UI shows analytics. The workflow debugger (breakpoints, step-through) lives in the pkg layer; the CLI debug is for agent execution. What’s in there Governance

Policy engine (CEL) Risk scoring Encrypted audit logs RBAC Multi-tenancy Fully YAML-configurable

Orchestration

Visual workflow designer (React Flow) DAG workflows Multi-agent coordination Conditional logic Plugin hot-reload Workflow marketplace

Memory & Context

Working memory Persistent memory Semantic memory Event log

Context assembly combines:

policies workflow state memory tool outputs knowledge

RAG features:

embeddings (OpenAI or local) SQLite for development Postgres + vector stores in production

Observability

Cost attribution via API SLA monitoring Distributed tracing (OpenTelemetry) Prometheus metrics Deterministic replay (5 modes)

Production

Kubernetes operator (Agent, Workflow, Policy CRDs) Helm charts Istio config Auto-scaling Backup / restore GraphQL + REST API

Implementation

~50k LOC of Go Hundreds of tests Built for production (in mind)

Runs on: Local

SQLite In-memory runtime

Production

Postgres Redis Qdrant / Weaviate

Happy to answer questions or help people get started

Deploy from GitHub Actions Without Storing Secrets (Using OIDC)

Two Months as a Vibe Coder

Finance in the Dark

I made the first eSIM service for OpenClaw

The 10x inference tax you don't have to pay

Show HN: Aside – Local meeting capture with vault-native AI distillation

Package Management Is Naming All the Way Down

Plasma Bigscreen – 10-foot interface for KDE plasma

Darwinian Post-Design

Qatar Shuts World’s Largest LNG Export Plant

Show HN: Kai – macOS native fully autonomous AI agent.

Invite people over to your home regularly

Show HN: Potatoverse, home for your vibecoded apps

Migrating Elderly Care AI from Qwen 3 to 3.5 on Apple Silicon – 14x Latency Fix

Show HN: AI Home Design – Room redesign and video tour and 3D panorama

Jacinda Arderns move to Australia spotlight on New Zealand's brain drain problem

Iran Just Triggered a Cryptic Shortwave Message [video][8m41s]

We may soon have 70M boomers too old to drive, too car-dependent to stop

Are AI Datacenters Increasing Electric Bills for American Households?

Tell HN: Gemini 3.1 Pro may be responding to other users' prompts

Show HN: ScopeGate – Granular permission gateway for AI agents (MCP, open-core)

What Happened to Agent Frameworks?

Show HN: PTEcorepractice – Free PTE Core Practice with Instant AI Scoring for PR

Some Tech Publications Consistently Score Products 5–7 Points Higher Than Others

Anthropic Made Pitch in Drone Swarm Contest During Pentagon Feud

Show HN: Preventing infinite agent loops with mathematical convergence gates

Wall Street Killed the Wildcatters: $100 Oil Now Means Bigger Buybacks

Moving to Sweden as an American

Apple Does Value (Week)

Building a FOSS live streaming camera