frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Reduce GVisor Cold Starts with GPU Snapshotting

https://cerebrium.ai/blog/reducing-gpu-cold-starts-with-memory-snapshots-restoring-cuda-workloads-in-second
26•jono_irwin•1h ago

Comments

nixosbestos•1h ago
Started scrolling, immediately closed the page. Something is deeply wrong with a person who chooses to implement this shit on a webpage. Unusable garbage, I'm sorry, literally making me motion sick somehow.
htrp•58m ago
Isn't this exactly what modal does?
za_mike157•44m ago
Hey! Yes you are correct! We have both been upstreaming changes to the main GVisor repo. However, in order to work within our own infrastructure we had to make various changes that we explain throughout the article (Open TCP connections, multiprocessing, unix sockets etc).

Also in our benchmarks we seem to perform better than Modal by ~20% in 4/6 workloads we tested and have a lower spread of results meaning you get more consistent results. However the same fundamentals still apply -> how can you move storage into memory as quickly as possible

gpgn_•42m ago
Interesting work. How does NVIDIA Dynamo Snapshot relate?
za_mike157•15m ago
There are a lot of similarities.

They run their snapshot agent as a Kubernetes DaemonSet, whereas our implementation runs as part of the Cerebrium container runtime path. Under the hood, both approaches rely on cuda-checkpoint, since cuda-checkpoint is currently the main primitive NVIDIA exposes for interacting with GPU memory during checkpoint/restore.

One difference is how KV cache handling is exposed. NVIDIA’s approach appears to automatically handle KV cache allocation/deallocation, whereas today we expose that choice to users (vLLM and SGLang expose primitives to to his). In some cases, users may want to discard the KV cache to reduce checkpoint size and restore time; in others, preserving it may be useful.

Their DaemonSet approach is also nice because it can be more portable across Kubernetes environments and clouds. Our approach is more deeply integrated into the node/runtime layer, which gives us tighter control over the serverless startup path, but also means it depends on custom node VM images, which not every provider supports equally.

The optimizations they mention around parallel memfd restore and Linux native AIO for anonymous memory could also be applied to our architecture if we find them stable and beneficial. That said, our current results are already pretty close. For example, they report restoring Qwen3-8B in 4.7s with those changes, while we currently restore it in 6.49s.

The biggest thing we are excited for is multi-GPU restore, which is not supported yet. That would unlock a much broader set of workloads.

mountainriver•31m ago
How does this compare to the CRIU work? Or does it use that under the hood?
za_mike157•9m ago
No we don't use it. CRIU is used for normal checkpoint/restore of Linux processes. Since we run GVisor for container isolation we use their checkpoint/restore support for the sandboxed process state.

Both approaches still need NVIDIA’s cuda-checkpoint for the GPU side, because CUDA/GPU memory and driver state are not something a normal process checkpointing tool can handle on its own.

For first time, a cell built from scratch grows and divides

https://www.quantamagazine.org/for-the-first-time-a-cell-built-from-scratch-grows-and-divides-202...
341•defrost•3h ago•103 comments

Physical disc production ending in Jan 2028 for new games on PlayStation

https://blog.playstation.com/2026/07/01/physical-disc-production-ending-in-january-2028-for-new-g...
268•Tiberium•5h ago•326 comments

How We Made IPFS Content Publishing 10x Faster

https://probelab.io/blog/optimistic-provide/
55•dennis-tra•2h ago•6 comments

Box3D, an open source 3D physics engine

https://box2d.org/posts/2026/06/announcing-box3d/
231•makepanic•5h ago•41 comments

FFmpeg 9.1's new AAC encoder

https://hydrogenaudio.org/index.php/topic,129691.0.html
29•ledoge•3h ago•15 comments

Monetization Gateway

https://blog.cloudflare.com/monetization-gateway/
120•soheilpro•3h ago•50 comments

Ask HN: Who is hiring? (July 2026)

49•whoishiring•2h ago•70 comments

Internal Combustion Engine

https://ciechanow.ski/internal-combustion-engine/
115•StefanBatory•4h ago•9 comments

Fixing a kubelet memory leak in Kubernetes 1.36

https://heyoncall.com/blog/fixing-kubernetes-kubelet-memory-leak
22•compumike•15h ago•4 comments

Ask HN: Who wants to be hired? (July 2026)

36•whoishiring•2h ago•97 comments

Launch HN: Parsewise (YC P25) – Reason Across Documents with an API

29•gergelycsegzi•3h ago•24 comments

Reduce GVisor Cold Starts with GPU Snapshotting

https://cerebrium.ai/blog/reducing-gpu-cold-starts-with-memory-snapshots-restoring-cuda-workloads...
26•jono_irwin•1h ago•7 comments

Sony Deletes 551 Movies PlayStation Owners Paid For

https://reclaimthenet.org/sony-deletes-551-studiocanal-movies-playstation-owners-paid-for
218•bilsbie•3h ago•107 comments

Manufact (YC S25) Is Hiring a Developer Advocate in SF

https://www.ycombinator.com/companies/manufact/jobs/4cyWd6S-developer-advocate-partnerships-devrel
1•luigipederzani•4h ago

Red Programming Language: Static linking support

https://www.red-lang.org/2026/06/static-linking-support.html
52•em-bee•1d ago•7 comments

Asahi Linux 7.1 Progress Report

https://asahilinux.org/2026/06/progress-report-7-1/
447•pantalaimon•7h ago•153 comments

1-Bit Pixel Art Emojis

https://hypertalking.com/2023/05/15/1-bit-pixel-art-emojis/
56•surprisetalk•6d ago•9 comments

Solid and Clean Code never felt solid or clean to me

https://devz.cl/posts/solid-never-felt-solid/
15•DanielVZ•1h ago•11 comments

Because It Speaks in Words

https://brianschrader.com/archive/because-it-speaks-in-words/
5•sonicrocketman•2d ago•0 comments

Nintendo has raised its employees base salary by 10%

https://mynintendonews.com/2026/06/26/nintendo-has-raised-its-employees-base-salary-by-10/
391•_tk_•6h ago•206 comments

Show HN: Pglayers – PostgreSQL extensions as stackable Docker layers

https://github.com/pglayers/pglayers
4•iemejia•45m ago•0 comments

Newly discovered spider builds spring loaded snare to catch ants

https://phys.org/news/2026-06-newly-australian-ballista-spider-snare.html
205•chimpanzee•2d ago•45 comments

Apple 'Hide My Email' vulnerability reveals peoples' real email addresses

https://easyoptouts.com/guides/apple-hide-my-email-is-leaking-email-addresses
90•sashk•7h ago•8 comments

Ray Tracer in SQL

https://github.com/ClickHouse/RayTracer
28•kbumsik•3h ago•7 comments

Most arguments are about ego, not ideas

https://wangcong.org/2026-06-30-why-i-stopped-arguing-with-people.html
545•backlit4034•4h ago•423 comments

Department of Commerce has lifted export controls on Claude Fable 5 and Mythos 5

https://twitter.com/AnthropicAI/status/2072106151890809341
871•Pragmata•17h ago•574 comments

Show HN: GolemUI – The new paradigm for JavaScript forms

https://golemui.com
13•wtfdeveloper•2h ago•21 comments

ArXiv's Next Chapter

https://blog.arxiv.org/2026/06/30/arxivs-next-chapter/
247•subset•14h ago•79 comments

Show HN: PMB – local memory for coding agents that shows if it is used

https://pmbai.dev
7•oleksiibond•2d ago•3 comments

Obfuscation: Building the final boss of cryptography (Part I)

https://vitalik.eth.limo/general/2026/06/29/obfuscation1.html
70•fbrusch•2d ago•9 comments