frontpage.

Hey HN,

Herd is a zero-dependency Go library that manages fleets of OS subprocesses and routes HTTP traffic to them with strict 1:1 session affinity.

If you put heavy, stateful binaries (like Ollama, headless Chromium, or Python REPLs) behind a standard reverse proxy and get a spike in traffic, it usually ends badly. You either trigger a massive CUDA/Metal context storm that OOM-kills the host machine, or you bleed state across different users' sessions.

Herd handles this without needing a heavy control plane like Kubernetes StatefulSets or Firecracker. It gives you automatic process lifecycle management and a built-in reverse proxy in about 10 lines of Go.

How it works under the hood:

- It spawns OS-level subprocesses via exec.Cmd.

- It routes incoming HTTP traffic based on any custom Session ID you define (a header, a cookie, a path parameter).

- If a session exists, it routes to that exact pinned OS process.

- If it doesn't, it safely acquires a singleflight lock, spawns a new process, waits for the /health endpoint, and proxies the request.

- If a process crashes, the blast radius is contained to one session, and the pool auto-recovers.

To test the concurrency constraints, I hurled 200 concurrent LLM inference requests at a Herd gateway backed by a pool capped at 10 Ollama (Qwen3:0.6B) workers on an M4 Pro Mac. It scored 200/200 with zero dropped packets, acting as a perfect backpressure queue to safely drip-feed the OS without thrashing the host's Unified Memory.

It’s MIT licensed. Would love for you to check out the repo, try to break the singleflight lock, or review the architecture.

Repo: https://github.com/HackStrix/herd

Architecture & Mermaid Diagrams: https://github.com/HackStrix/herd/blob/main/docs/ARCHITECTUR...

Werner Herzog Between Fact and Fiction

Show HN: Kiorg – a battery included file manager for keyboard nerds

Turn Your Handwriting into a Font

D-Illusion

You Can Use Stories to Hack the Human Brain

Show HN: Iceberg Map

High-performance Go web framework; Ships with OpenTelemetry, OpenAPI docs

Show HN: Hosted OpenClaw – 60s setup, no Mac Mini, $99 lifetime BYOK

Why developers using AI are working longer hours

Trump administration rolls back payday loan protections, affects youth (2019)

One in three using AI for emotional support and conversation, UK says

Dutch gov't pulls report on dangers of American cloud service after criticism

Agile legged locomotion in reconfigurable modular robots

Anthropic mapped out jobs AI replaces. Great Recession for white-collar workers

A new clue to how the body detects physical force

How to run Qwen 3.5 locally

Cost of physical therapy varies widely from state to state: study

The Death of the Cheap Laptop Is Coming

Philosopher of the Apocalypse

Sunsetting the 512kb Club

Put the zipcode first

Nix is a lie, and that's ok

Show HN: PolicyCortex – AI agent that autonomously remediates cloud misconfigs

OpenAI GPT-5.4 Explained

Technological Folie à Deux

Show HN: Beam Protocol – SMTP for AI Agents (natural language agent-to-agent)

Nauticuvs – pure-Rust curvelet transform for SAR sonar, by a self-taught dev

A subreddit for people who believe in AI sentience

Grow Fast and Overload Things

When ChatGPT is gone: Creativity reverts and homogeneity persists (2024)

Show HN: Herd – Session-affine process pool for Go