frontpage.

I built thaw because forking an LLM agent is absurdly wasteful today. When an agent explores N branches — RL rollouts, best-of-N, parallel coding attempts — each branch re-runs prefill over the same shared context. You pay for the same prompt N times.

thaw snapshots a live inference session — weights, KV cache, scheduler state, and the prefix-hash table — and hydrates N children that diverge from the fork point without re-prefilling. It's `git branch` for a running model.

The receipt (H100 80GB, Llama-3.1-8B, real hardware): a pre-warmed pool boots once in 22.3s, then each fork round of 4 branches × 64 tokens runs in 0.88s median. Cold-boot equivalent would be ~340s/round — ~400× amortized. All rounds bit-identical at the fork boundary. Full JSON receipt + reproducer in the repo, nothing hand-waved.

NVIDIA shipped Dynamo Snapshot last week for fast pod cold-starts — and they free the KV cache before checkpoint, by design. thaw is the opposite bet: preserve the KV cache so a fork is near-free. Different problem, opposite mechanic.

pip install thaw-vllm. Works with vLLM and SGLang, Apache-2.0.

https://github.com/thaw-ai/thaw

I'm a solo dev and this is the thing I most want feedback on: is the fork primitive the right shape, or do people want it wrapped in a framework(LangGraph/TRL) node instead? Happy to go deep on the KV-restore internals.

Is Huawei's new chip scaling law a true breakthrough, or mere hype

Cheese Paper: a text editor specifically designed for writing

Show HN: Kanji Pairs Explorer

Building a custom mount for a telescoping webcam

Meta urges Labour to burden Apple with age checks

An Elephant Who Demonstrated That Her Species Might Be Self-Aware, Dies at 55

Arch-Decision – A multi-agent architecture tool for Claude Code

Mapping how the brain takes out its trash

Short-lived certificates: a nuisance or an automation opportunity?

Professional Sports Are Banning Smart Glasses over Betting Concerns

A new way to build chips: Sequentially stacking silicon to extend Moore's Law

Body Keeps the Score – The Gut-Brain Connection Nobody Told You About

Ask HN: Students, What Impact Is AI Having on Your Education?

SEC Commissioner Peirce defends crypto privacy tools against surveillance push

The Journal of Hendrick Hamel (1668)

Soon, Nearly a Third of Americans Will Live in States with Legal Aid in Dying

Citadel loses challenge to SEC approval of new options exchange

Starbucks Abandons Borked AI Inventory Tool That Couldn't Count

Two abandoned Soviet space shuttles left in the Kazakh steppe (2017)

China's Rise in Drug Development Looms over U.S.

Tony Gilroy, Andor creator doesn't want his work to become training data

DeepSWE: More and cheaper intelligence from maxed GPT 5.5 than maxed Opus 4.8

After more than two decades Paint.NET finally owns the domain paint.net

What Makes an Exceptional Engineer?

A UX Focused Guide to Building a Linux Distro for Normies

Show HN: Thaw – Git branch for a running LLM (fork agents, skip prefill)

Shantell Sans

Show HN: Babo – A scripting natural language that works as intended

We Benchmarked Our Open Source Memory Tool Against a Microsoft Research Paper

Show HN: HN Station – A local-first HN desktop client with split-pane reading

Show HN: Thaw – Git branch for a running LLM (fork agents, skip prefill)

Is Huawei's new chip scaling law a true breakthrough, or mere hype

Cheese Paper: a text editor specifically designed for writing

Show HN: Kanji Pairs Explorer

Building a custom mount for a telescoping webcam

Meta urges Labour to burden Apple with age checks

An Elephant Who Demonstrated That Her Species Might Be Self-Aware, Dies at 55

Arch-Decision – A multi-agent architecture tool for Claude Code

Mapping how the brain takes out its trash

Short-lived certificates: a nuisance or an automation opportunity?

Professional Sports Are Banning Smart Glasses over Betting Concerns

A new way to build chips: Sequentially stacking silicon to extend Moore's Law

Body Keeps the Score – The Gut-Brain Connection Nobody Told You About

Ask HN: Students, What Impact Is AI Having on Your Education?

SEC Commissioner Peirce defends crypto privacy tools against surveillance push

The Journal of Hendrick Hamel (1668)

Soon, Nearly a Third of Americans Will Live in States with Legal Aid in Dying

Citadel loses challenge to SEC approval of new options exchange

Starbucks Abandons Borked AI Inventory Tool That Couldn't Count

Two abandoned Soviet space shuttles left in the Kazakh steppe (2017)

China's Rise in Drug Development Looms over U.S.

Tony Gilroy, Andor creator doesn't want his work to become training data

DeepSWE: More and cheaper intelligence from maxed GPT 5.5 than maxed Opus 4.8

After more than two decades Paint.NET finally owns the domain paint.net

What Makes an Exceptional Engineer?

A UX Focused Guide to Building a Linux Distro for Normies

Show HN: Thaw – Git branch for a running LLM (fork agents, skip prefill)

Shantell Sans

Show HN: Babo – A scripting natural language that works as intended

We Benchmarked Our Open Source Memory Tool Against a Microsoft Research Paper

Show HN: HN Station – A local-first HN desktop client with split-pane reading