frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Checkpoint K8s pods transparently (plain CPU or GPU accelerated) [video]

https://www.youtube.com/watch?v=K9yY6_2255Y
2•qnib•3h ago
Hi HN,

I am an early adopter of containers with a background in HPC. From the early days, I’ve tried to merge container tech into the HPC (and HTC) stack. Containers already make packing and deployment easier - especially in AI/ML and data science. How about checkpoint/restore?

Over the last couple of months, we at MemVerge have developed a Kubernetes Operator for transparent checkpointing and restoring, allowing you to use discounted Spot instances for long-running workloads, like bioinformatics workflows or ML training.

Here’s how it works: - the operator attaches a PVC to your pod - intercepts the STOP signal to checkpoint the pod - if the attached PVC contains a checkpoint when the pod is starting over, it will be restored instead of starting from scratch.

Here’s a 2m30s video that demonstrates interrupting a small training workload: https://youtu.be/K9yY6_2255Y

This can be triggered by someone draining the node (e.g., due to an EC2 Spot reclaim), deleting a pod, or another operator acting on its own logic. Our checkpoint engine captures every aspect of the process tree within the container: memory pages, file descriptors—even TCP connections, if you want us to. Until recently, it was targeted at CPU use cases only. We’ve now added support for NVIDIA GPUs, with AMD GPUs coming soon (via upstream CRIU plugins).

I’ve done some typical checkpoint/restore work (e.g., Jupyter notebooks, traditional jobs) and would love to hear what kinds of workloads you’re interested in checkpointing and restoring.

You can try it out in your Kubernetes environment with our 60-day trial: https://form.typeform.com/to/vZujMYxI

Language Transfer – The Thinking Method (free language courses)

https://www.languagetransfer.org
1•alexmorley•1m ago•0 comments

Geodesy for the Layman (1984)

https://alexanderbass.com/library/geodesy-for-layman/
2•froober•1m ago•0 comments

Show HN: StopAddict – Quit addictions with a clean, gamified tracker

1•skyzouw•1m ago•1 comments

GUI in Pure Rust

https://github.com/emilk/egui
1•vishnumohandas•3m ago•0 comments

Show HN: I Made an Extension That Makes You Money

https://www.mintcashback.com/
1•alexlekkas•6m ago•1 comments

Rust 1.88.0 hits stable with let-chains support

https://releases.rs/docs/1.88.0/
1•wmstack•7m ago•0 comments

You Don't Own the Word "Freedom"

https://fireborn.mataroa.blog/blog/you-dont-own-the-word-freedom-a-full-burn-response-to-the-gnulinux-comment-that-tried-to-gatekeep-me-off-my-own-machine/
1•DHowett•12m ago•0 comments

Show HN: Zenta – Mindfulness for Terminal Users

https://github.com/e6a5/zenta
2•ihiep•13m ago•0 comments

Show HN: Zeptaframe – Open-source click-and-drag precision for AI video gen

https://github.com/Pablerdo/zeptaframe
2•Pablerdo•13m ago•0 comments

DeepSeek R2 launch stalled as CEO balks at progress

https://www.reuters.com/world/china/deepseek-r2-launch-stalled-ceo-balks-progress-information-reports-2025-06-26/
1•nsoonhui•14m ago•0 comments

Extending Anthropic's Agent Workflows with Recursive Planning

https://actamachina.com/posts/recursive-planning
1•tlyleung•16m ago•0 comments

Show HN: 10x Kubernetes Cluster on Hetzner Cloud

https://github.com/identiops/terraform-hcloud-k3s
2•jceb81•18m ago•0 comments

Get AI-powered command suggestions **directly** in your zsh shell

https://github.com/yetone/smart-suggestion
1•wey-gu•21m ago•1 comments

Apple reveals complex system of App Store fees to avoid E.U. fine of 500M euros

https://www.cnbc.com/2025/06/26/apple-eu-500-million-euro-app-store.html
1•arnon•30m ago•1 comments

Windows Resiliency Initiative: Building resilience for a future-ready enterprise

https://blogs.windows.com/windowsexperience/2025/06/26/the-windows-resiliency-initiative-building-resilience-for-a-future-ready-enterprise/
1•XzetaU8•34m ago•0 comments

Why Go Rocks for Building a Lua Interpreter

https://www.zombiezen.com/blog/2025/06/why-go-rocks-for-building-lua-interpreter/
3•thunderbong•40m ago•0 comments

Simplifying Vulkan Synchronization

https://www.khronos.org/blog/so-long-image-layouts-simplifying-vulkan-synchronisation
1•Bogdanp•49m ago•0 comments

Police identify seven as main suspects in Post Office Horizon scandal inquiry

https://www.theguardian.com/uk-news/2025/jun/27/police-identify-seven-as-main-suspects-in-post-office-horizon-scandal-inquiry
3•chrisjj•53m ago•0 comments

The 90% Gravity Problem: Why We Tend to Quit Right Before the Finish Line

2•darwinSir•55m ago•3 comments

Show HN: Tic-Tac-Toe in Pure CSS (No JavaScript/HTML)

https://lyra.horse/fun/tic-tac-nohtml/
3•rebane2001•56m ago•1 comments

How I Lost My Career and Started Delivering Mail

https://www.wsj.com/lifestyle/careers/how-i-lost-my-career-and-started-delivering-mail-c238d934
1•impish9208•56m ago•1 comments

Salesforce CEO Claims Half of the Company's Work Is Now Done by AI

https://gizmodo.com/salesforce-ceo-claims-half-of-the-companys-work-is-now-done-by-ai-2000620730
2•01-_-•58m ago•2 comments

An educational website for forex traders

https://www.fx-trading.space/
1•nonplayercaesar•1h ago•0 comments

From Side Project to 10k Monthly Users: My Lessons from Building a Dev Tool Solo

https://www.indiehackers.com/post/from-side-project-to-10-000-monthly-users-my-lessons-from-building-a-dev-tool-solo-NjmjHV37XNY9kJ2ckKg0
2•anil75•1h ago•0 comments

Scoop: Trump admin cuts contracts with scientific publishing giant

https://www.axios.com/2025/06/25/trump-cuts-contracts-scientific-publisher
1•01-_-•1h ago•0 comments

AIVocal-AI Podcast

https://aivocal.io/ai-podcast
1•18272837023•1h ago•0 comments

Book Review: Developing Talent in Young People by Benjamin Bloom

https://www.justinmath.com/book-review-bloom-developing-talent-in-young-people/
2•rzk•1h ago•1 comments

The 90% Gravity Problem: Why We Tend to Quit Right Before the Finish Line

5•darwinSir•1h ago•4 comments

Show HN: Daf·thunk – open-source Editor for Prototyping Workflows on Cloudflare

https://www.dafthunk.com/
6•bchapuis•1h ago•1 comments

Speeding up global DNS resolution by avoiding CNAMES

https://thomas-leister.de/en/accelerating-global-dns-cnames/
1•stoerfall•1h ago•2 comments