frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

DoNotNotify is now Open Source

https://donotnotify.com/opensource.html
213•awaaz•4h ago•35 comments

Dave Farber has passed away

https://lists.nanog.org/archives/list/nanog@lists.nanog.org/thread/TSNPJVFH4DKLINIKSMRIIVNHDG5XKJCM/
23•vitplister•54m ago•4 comments

Why E cores make Apple Silicon fast

https://eclecticlight.co/2026/02/08/last-week-on-my-mac-why-e-cores-make-apple-silicon-fast/
10•ingve•1h ago•1 comments

Matchlock: Linux-based sandboxing for AI agents

https://github.com/jingkaihe/matchlock
44•jingkai_he•4h ago•8 comments

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

https://github.com/localgpt-app/localgpt
249•yi_wang•11h ago•122 comments

Reverse Engineering Raiders of the Lost Ark for the Atari 2600

https://github.com/joshuanwalker/Raiders2600
16•pacod•3h ago•1 comments

Haskell for all: Beyond agentic coding

https://haskellforall.com/2026/02/beyond-agentic-coding
148•RebelPotato•10h ago•43 comments

(AI) Slop Terrifies Me

https://ezhik.jp/ai-slop-terrifies-me/
39•Ezhik•2h ago•23 comments

SectorC: A C Compiler in 512 bytes (2023)

https://xorvoid.com/sectorc.html
325•valyala•18h ago•66 comments

LLMs as the new high level language

https://federicopereiro.com/llm-high/
138•swah•5d ago•259 comments

Rabbit Ear "Origami": programmable origami in the browser (JS)

https://rabbitear.org/book/origami.html
16•molszanski•3d ago•3 comments

The Architecture of Open Source Applications (Volume 1) Berkeley DB

https://aosabook.org/en/v1/bdb.html
47•grep_it•5d ago•8 comments

Software factories and the agentic moment

https://factory.strongdm.ai/
243•mellosouls•21h ago•404 comments

Curating a Show on My Ineffable Mother, Ursula K. Le Guin

https://hyperallergic.com/curating-a-show-on-my-ineffable-mother-ursula-k-le-guin/
4•bryanrasmussen•2h ago•0 comments

Modern and Antique Technologies Reveal a Dynamic Cosmos

https://www.quantamagazine.org/how-modern-and-antique-technologies-reveal-a-dynamic-cosmos-20260202/
11•sohkamyung•5d ago•0 comments

Speed up responses with fast mode

https://code.claude.com/docs/en/fast-mode
196•surprisetalk•18h ago•202 comments

Hoot: Scheme on WebAssembly

https://www.spritely.institute/hoot/
199•AlexeyBrin•1d ago•36 comments

uLauncher

https://github.com/jrpie/launcher
41•dtj1123•5d ago•10 comments

Stories from 25 Years of Software Development

https://susam.net/twenty-five-years-of-computing.html
216•vinhnx•21h ago•26 comments

The Legacy of Daniel Kahneman: A Personal View (2025)

https://ejpe.org/journal/article/view/1075/753
8•cainxinth•3d ago•0 comments

Vocal Guide – belt sing without killing yourself

https://jesperordrup.github.io/vocal-guide/
378•jesperordrup•1d ago•121 comments

Brookhaven Lab's RHIC concludes 25-year run with final collisions

https://www.hpcwire.com/off-the-wire/brookhaven-labs-rhic-concludes-25-year-run-with-final-collis...
86•gnufx•17h ago•66 comments

LineageOS 23.2

https://lineageos.org/Changelog-31/
92•pentagrama•6h ago•25 comments

Wood Gas Vehicles: Firewood in the Fuel Tank (2010)

https://solar.lowtechmagazine.com/2010/01/wood-gas-vehicles-firewood-in-the-fuel-tank/
59•Rygian•3d ago•29 comments

In the Australian outback, we're listening for nuclear tests

https://www.abc.net.au/news/2026-02-08/australian-outback-nuclear-tests-listening-warramunga-faci...
20•defrost•2h ago•4 comments

First Proof

https://arxiv.org/abs/2602.05192
158•samasblack•21h ago•97 comments

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

https://github.com/Momciloo/fun-with-clip-path
119•momciloo•18h ago•29 comments

Start all of your commands with a comma (2009)

https://rhodesmill.org/brandon/2009/commands-with-comma/
623•theblazehen•3d ago•224 comments

Substack confirms data breach affects users’ email addresses and phone numbers

https://techcrunch.com/2026/02/05/substack-confirms-data-breach-affecting-email-addresses-and-pho...
86•witnessme•7h ago•38 comments

Al Lowe on model trains, funny deaths and working with Disney

https://spillhistorie.no/2026/02/06/interview-with-sierra-veteran-al-lowe/
114•thelok•20h ago•28 comments
Open in hackernews

GPU memory snapshots: sub-second startup (2025)

https://modal.com/blog/gpu-mem-snapshots
27•jxmorris12•1mo ago

Comments

erwaen98•4w ago
Looks great
erichocean•4w ago
Tried it out, first curl after deploy gave me a 303, but second attempt worked.
Imustaskforhelp•4w ago
Is modal running every single service inside gvisor?

I have heard that gvisor isn't recommended to run every single production but rather only some front facing or some other activities but it has some serious performance degradation which is why most end up using firecracker

This is really cool though, does this mean that we could probably have AI models that are snapshotted?

Are the states of checkpoint/recovery encrypted by default or how would that even work? Like what are the privacy aspects of it. I don't think even using something like modal would be the private llm that many people sometimes want on subreddits like localllama but the people dont have gpu. of course nothing beats privacy if you have your own gpu's but I'd be curious to know what people's thoughts are

markasoftware•4w ago
the thing is modal is running untrusted containers, so there's not really a concept of "some front facing" containers. Any container running an untrusted workload is at high risk / is "front facing".

If Modal's customers' workloads are mainly GPU-bound, then the performance hit of gvisor isn't as big as it might be for other workloads. GPU activity does have to go through the fairly heavyweight nvproxy to be executed on the host, but most gpu activity is longer-lived async calls like running kernels so a bit of overhead in starting / retrieving the results from those calls can be tolerated.

Imustaskforhelp•4w ago
Well if someone is gonna use Modal exactly for GPU purposes then I guess its okay but anything compute related just feels like it would have some issues performance wise

So I can agree that perhaps Modal might make sense for LLM's but they position themselves as sandbox including something like running python code etc. and some of this may be more intensive in workflows than others so I just wanted to point it out

Fly.io uses firecracker so I kinda like firecracker related applications (I tried to run firecracker myself its way too hard to build your own firecracker based provider or anything) and they recently released https://sprites.dev/

E2B is another well known solution out there. I have talked to their developers once and they mentioned that they run it on top of gcp

I am really interested in kata containers as well because I think kata runs on top of firecracker and can hook with docker rather quickly.

amitprasad•4w ago
If you're not looking for GPU snapshotting the ecosystem is relatively mature. Specifically, CPU-only VM-based snapshotting techniques are pretty well understood. However, if you need GPUs, this is a notoriously hard problem. IIRC Fly also was planning on using gVisor (EDIT: cloud-hypervisor) for their GPU cloud, but abandoned the effort [1].

Kata runs atop many things, but is a little awkward because it creates a "pod" (VM) inside which it creates 1+ containers (runc/gVisor). Firecracker is also awkward because GPU support is pretty hard / impossible.

[1] https://fly.io/blog/wrong-about-gpu/

Imustaskforhelp•3w ago
Ohh this makes sense now. Firecracker is good for compute related workflows but gvisor is more good for GPU related workflows, gotcha.

For my use cases usually, its Firecracker but I can now see why company like Modal would use gvisor because they focus a lot (and I mean a lot) on providing gpu access. I think that its one of their largest selling points or one of them, for them compute is secondary customer and gvisor's compute performance hit is a well worth trade off for them

Thanks for trying to explain the situation!

zackangelo•4w ago
This uses Nvidia’s CUDA snapshot API under the hood, but you have to pair it with a host side snapshot as well. Modal uses gVisor for this, which is notoriously high overhead.

Does anyone know of a more efficient alternative if you’re running a trusted container?

luiscape•3w ago
Post author here: there are other projects that will create a proxy for CUDA calls and use the log of CUDA operations to checkpoint / restore or live migration tasks. We haven’t used them. I don’t believe they are very popular nor used outside specific orgs.

This is the only API available for snapshotting NVIDIA GPU memory, afaik.

As for needing to combine it with a host memory snapshot step, this is required because CUDA sessions need to be mapped to a host process, so you need to snapshot both things in order for the program to be restored correctly.

CRIU is another project that uses the same technique (CUDA snapshot + host memory snapshot). Different than CRIU, our snapshots work at the function level so we’re able to take snapshots after functions have been initialized (including GPU memory), making Modal cold boots fast. One would have to implement this entire process using CRIU.

vivzkestrel•4w ago
- as a guy not familiar or in loop with all these sandbox products, i have a quick question for anyone reading this

- what is the difference between docker and modal?

- what does modal do that docker doesnt?

- what is the cold start time comparison between both?

- how do both of these differ from something called "Firecracker VM"?

BobbyTables2•4w ago
I can describe firecracker.

With Intel VMX virtualization, instruction execution is handled by the CPU but (a lot) of software still has to deal with HW peripheral emulation .

QEMU uses KVM (Intel VMX, etc) but implements HW peripherals (display, network, disk, etc) faithfully matching really HW and provides a full BIOS (SeaBios) or UEFI firmware (EDK) to deal with with boot process.

Over time, Linux (and Windows) were extended to support novel “peripherals” designed for high emulation performance (not a real HW product).

Firecracker basically skips all the “real” peripheral emulation and skips the full BIOS/UEFI firmware. Instead, it implements just enough to boot modern Linux directly. Also written in Rust instead of C. It will never support DOS, Windows 95 or probably anything else.

The “microVM” BIOS allows it to start booting Linux very quickly (sub-second). A traditional QEMU VM might take 2-5 seconds. Some people are emboldened to effectively move back from containers to running applications in a VM…

Instead of the VM being long lived, it is really just for running a single app.

I think Kata containers had this idea for much longer but Firecracker provides a more efficient implementation for such a thing.

vivzkestrel•4w ago
thank you very much for the detail there. I assume you would also know very well how a docker container would compare to firecracker in terms of boot time. I understand that a container and a VM are not the same thing but just curious
BobbyTables2•3w ago
The overhead to starting a docker container is practically zero. A new namespace and a few overlayfs mounts are virtually instantaneous.

Roughly speaking, once the kernel has booted inside a VM, it launches the first process which would be the “container” for a “firecracker container”.

Certainly possible to get kernel boot times below 1 second.