frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Engineering Perception with Combinatorial Memetics

1•alan_sass•1m ago•1 comments

Show HN: Steam Daily – A Wordle-like daily puzzle game for Steam fans

https://steamdaily.xyz
1•itshellboy•3m ago•0 comments

The Anthropic Hive Mind

https://steve-yegge.medium.com/the-anthropic-hive-mind-d01f768f3d7b
1•spenvo•3m ago•0 comments

Just Started Using AmpCode

https://intelligenttools.co/blog/ampcode-multi-agent-production
1•BojanTomic•4m ago•0 comments

LLM as an Engineer vs. a Founder?

1•dm03514•5m ago•0 comments

Crosstalk inside cells helps pathogens evade drugs, study finds

https://phys.org/news/2026-01-crosstalk-cells-pathogens-evade-drugs.html
2•PaulHoule•6m ago•0 comments

Show HN: Design system generator (mood to CSS in <1 second)

https://huesly.app
1•egeuysall•6m ago•1 comments

Show HN: 26/02/26 – 5 songs in a day

https://playingwith.variousbits.net/saturday
1•dmje•7m ago•0 comments

Toroidal Logit Bias – Reduce LLM hallucinations 40% with no fine-tuning

https://github.com/Paraxiom/topological-coherence
1•slye514•9m ago•1 comments

Top AI models fail at >96% of tasks

https://www.zdnet.com/article/ai-failed-test-on-remote-freelance-jobs/
3•codexon•9m ago•1 comments

The Science of the Perfect Second (2023)

https://harpers.org/archive/2023/04/the-science-of-the-perfect-second/
1•NaOH•10m ago•0 comments

Bob Beck (OpenBSD) on why vi should stay vi (2006)

https://marc.info/?l=openbsd-misc&m=115820462402673&w=2
2•birdculture•14m ago•0 comments

Show HN: a glimpse into the future of eye tracking for multi-agent use

https://github.com/dchrty/glimpsh
1•dochrty•15m ago•0 comments

The Optima-l Situation: A deep dive into the classic humanist sans-serif

https://micahblachman.beehiiv.com/p/the-optima-l-situation
2•subdomain•15m ago•0 comments

Barn Owls Know When to Wait

https://blog.typeobject.com/posts/2026-barn-owls-know-when-to-wait/
1•fintler•15m ago•0 comments

Implementing TCP Echo Server in Rust [video]

https://www.youtube.com/watch?v=qjOBZ_Xzuio
1•sheerluck•16m ago•0 comments

LicGen – Offline License Generator (CLI and Web UI)

1•tejavvo•19m ago•0 comments

Service Degradation in West US Region

https://azure.status.microsoft/en-gb/status?gsid=5616bb85-f380-4a04-85ed-95674eec3d87&utm_source=...
2•_____k•19m ago•0 comments

The Janitor on Mars

https://www.newyorker.com/magazine/1998/10/26/the-janitor-on-mars
1•evo_9•21m ago•0 comments

Bringing Polars to .NET

https://github.com/ErrorLSC/Polars.NET
3•CurtHagenlocher•23m ago•0 comments

Adventures in Guix Packaging

https://nemin.hu/guix-packaging.html
1•todsacerdoti•24m ago•0 comments

Show HN: We had 20 Claude terminals open, so we built Orcha

1•buildingwdavid•24m ago•0 comments

Your Best Thinking Is Wasted on the Wrong Decisions

https://www.iankduncan.com/engineering/2026-02-07-your-best-thinking-is-wasted-on-the-wrong-decis...
1•iand675•24m ago•0 comments

Warcraftcn/UI – UI component library inspired by classic Warcraft III aesthetics

https://www.warcraftcn.com/
1•vyrotek•25m ago•0 comments

Trump Vodka Becomes Available for Pre-Orders

https://www.forbes.com/sites/kirkogunrinde/2025/12/01/trump-vodka-becomes-available-for-pre-order...
1•stopbulying•26m ago•0 comments

Velocity of Money

https://en.wikipedia.org/wiki/Velocity_of_money
1•gurjeet•29m ago•0 comments

Stop building automations. Start running your business

https://www.fluxtopus.com/automate-your-business
1•valboa•33m ago•1 comments

You can't QA your way to the frontier

https://www.scorecard.io/blog/you-cant-qa-your-way-to-the-frontier
1•gk1•34m ago•0 comments

Show HN: PalettePoint – AI color palette generator from text or images

https://palettepoint.com
1•latentio•35m ago•0 comments

Robust and Interactable World Models in Computer Vision [video]

https://www.youtube.com/watch?v=9B4kkaGOozA
2•Anon84•39m ago•0 comments
Open in hackernews

Show HN: Checkpoint K8s pods transparently (plain CPU or GPU accelerated) [video]

https://www.youtube.com/watch?v=K9yY6_2255Y
2•qnib•7mo ago
Hi HN,

I am an early adopter of containers with a background in HPC. From the early days, I’ve tried to merge container tech into the HPC (and HTC) stack. Containers already make packing and deployment easier - especially in AI/ML and data science. How about checkpoint/restore?

Over the last couple of months, we at MemVerge have developed a Kubernetes Operator for transparent checkpointing and restoring, allowing you to use discounted Spot instances for long-running workloads, like bioinformatics workflows or ML training.

Here’s how it works: - the operator attaches a PVC to your pod - intercepts the STOP signal to checkpoint the pod - if the attached PVC contains a checkpoint when the pod is starting over, it will be restored instead of starting from scratch.

Here’s a 2m30s video that demonstrates interrupting a small training workload: https://youtu.be/K9yY6_2255Y

This can be triggered by someone draining the node (e.g., due to an EC2 Spot reclaim), deleting a pod, or another operator acting on its own logic. Our checkpoint engine captures every aspect of the process tree within the container: memory pages, file descriptors—even TCP connections, if you want us to. Until recently, it was targeted at CPU use cases only. We’ve now added support for NVIDIA GPUs, with AMD GPUs coming soon (via upstream CRIU plugins).

I’ve done some typical checkpoint/restore work (e.g., Jupyter notebooks, traditional jobs) and would love to hear what kinds of workloads you’re interested in checkpointing and restoring.

You can try it out in your Kubernetes environment with our 60-day trial: https://form.typeform.com/to/vZujMYxI

Comments

pveldandi•7mo ago
Really interesting work. we’ve been building a container-native snapshotting system too, but focused on cold start reduction and multi-model orchestration for LLM inference.

Different use case (sub-2s loading for large models), but very similar challenges around memory, device state, and restore reliability.

qnib•7mo ago
Radostin from RedHat did some interesting work on hot-swapping (kinda cold-start) of models: https://www.researchgate.net/publication/392759373_Transpare... Definitely something that might be added in the future.
jwcesign•7mo ago
So, this solution is built on CRIU?
qnib•7mo ago
essentially yes. The missing piece is experience in making sure the right files, descriptors and memory blobs are saved and brought back in the right order and the coordination with different schedulers (like K8s in this case, but also AWS Batch, HTCondor, SLURM, ...).

EC2 Spot only gives you 2min in which you need to wrap up everything, GCP even only 30s. The time pressure is also an obstacle to deal with.