frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I hate paying for GPUs while developing – this is how I solved it

https://adithyask.medium.com/write-deep-learning-code-locally-and-run-on-gpus-instantly-6f173104b334
1•Adithya-Kolavi•4mo ago
So let's face it, if you’re doing anything with deep learning, GPUs are essential.

They’re expensive, and setting up infrastructure is painful. Most of the time, your GPUs sit idle while you’re coding, yet you still pay for uptime when scripts fail on the first try.

I ran into this as a “GPU poor” researcher. Tasks like downloading datasets or transforming data don’t need a GPU, but traditional setups force you to use one. Cloud setups don’t help—VMs with GPUs require manual environment setup, CUDA installations, or Docker containers just to get started. Multi-GPU training adds more headaches: not all images support NCCL, so communication between nodes can fail

At my research lab[1], we run experiments across model training, synthetic data generation, and RL. We needed a setup that was flexible, reliable, easy to use and collaborate.

When I went looking for a solution that would let me write code locally and run it on GPUs instantly, without worrying about infrastructure, multi-node setups, or idle GPU time, I stumbled upon Modal [2], and after a year of using it, it’s been a game-changer: it increases our research throughput and productivity, saves a ton on GPU costs and infrastructure management, and allows us to ship really fast.

I’ve compiled everything we’ve learned into this blog + hands-on tutorial [3], with three examples showing different ways to use Modal: rapidly develop on GPUs, deploy at scale, and do it all without breaking a sweat over infrastructure.

Here’s what we cover in the blog: - Wrapping existing code to run on Modal’s serverless infrastructure. - Handling datasets on Modal with volumes for seamless access. - Writing training scripts using Unsloth and Axolotl for easy fine-tuning. - Serving models in a scalable, high-throughput way with vLLM.

By the end, you’ll know how to write and experiment locally and run on GPUs instantly—no idle bills, no complex environment setup, no multi-node headaches.

[1] https://cognitivelab.in [2] https://modal.com [3] https://aiengineering.academy/LLM/ServerLessFinetuning/

Why there is no official statement from Substack about the data leak

https://techcrunch.com/2026/02/05/substack-confirms-data-breach-affecting-email-addresses-and-pho...
2•witnessme•3m ago•1 comments

Effects of Zepbound on Stool Quality

https://twitter.com/ScottHickle/status/2020150085296775300
1•aloukissas•6m ago•0 comments

Show HN: Seedance 2.0 – The Most Powerful AI Video Generator

https://seedance.ai/
1•bigbromaker•9m ago•0 comments

Ask HN: Do we need "metadata in source code" syntax that LLMs will never delete?

1•andrewstuart•15m ago•1 comments

Pentagon cutting ties w/ "woke" Harvard, ending military training & fellowships

https://www.cbsnews.com/news/pentagon-says-its-cutting-ties-with-woke-harvard-discontinuing-milit...
3•alephnerd•18m ago•1 comments

Can Quantum-Mechanical Description of Physical Reality Be Considered Complete? [pdf]

https://cds.cern.ch/record/405662/files/PhysRev.47.777.pdf
1•northlondoner•18m ago•1 comments

Kessler Syndrome Has Started [video]

https://www.tiktok.com/@cjtrowbridge/video/7602634355160206623
1•pbradv•21m ago•0 comments

Complex Heterodynes Explained

https://tomverbeure.github.io/2026/02/07/Complex-Heterodyne.html
3•hasheddan•21m ago•0 comments

EVs Are a Failed Experiment

https://spectator.org/evs-are-a-failed-experiment/
2•ArtemZ•33m ago•4 comments

MemAlign: Building Better LLM Judges from Human Feedback with Scalable Memory

https://www.databricks.com/blog/memalign-building-better-llm-judges-human-feedback-scalable-memory
1•superchink•34m ago•0 comments

CCC (Claude's C Compiler) on Compiler Explorer

https://godbolt.org/z/asjc13sa6
2•LiamPowell•36m ago•0 comments

Homeland Security Spying on Reddit Users

https://www.kenklippenstein.com/p/homeland-security-spies-on-reddit
3•duxup•38m ago•0 comments

Actors with Tokio (2021)

https://ryhl.io/blog/actors-with-tokio/
1•vinhnx•40m ago•0 comments

Can graph neural networks for biology realistically run on edge devices?

https://doi.org/10.21203/rs.3.rs-8645211/v1
1•swapinvidya•52m ago•1 comments

Deeper into the shareing of one air conditioner for 2 rooms

1•ozzysnaps•54m ago•0 comments

Weatherman introduces fruit-based authentication system to combat deep fakes

https://www.youtube.com/watch?v=5HVbZwJ9gPE
3•savrajsingh•55m ago•0 comments

Why Embedded Models Must Hallucinate: A Boundary Theory (RCC)

http://www.effacermonexistence.com/rcc-hn-1-1
1•formerOpenAI•56m ago•2 comments

A Curated List of ML System Design Case Studies

https://github.com/Engineer1999/A-Curated-List-of-ML-System-Design-Case-Studies
3•tejonutella•1h ago•0 comments

Pony Alpha: New free 200K context model for coding, reasoning and roleplay

https://ponyalpha.pro
1•qzcanoe•1h ago•1 comments

Show HN: Tunbot – Discord bot for temporary Cloudflare tunnels behind CGNAT

https://github.com/Goofygiraffe06/tunbot
2•g1raffe•1h ago•0 comments

Open Problems in Mechanistic Interpretability

https://arxiv.org/abs/2501.16496
2•vinhnx•1h ago•0 comments

Bye Bye Humanity: The Potential AMOC Collapse

https://thatjoescott.com/2026/02/03/bye-bye-humanity-the-potential-amoc-collapse/
3•rolph•1h ago•0 comments

Dexter: Claude-Code-Style Agent for Financial Statements and Valuation

https://github.com/virattt/dexter
1•Lwrless•1h ago•0 comments

Digital Iris [video]

https://www.youtube.com/watch?v=Kg_2MAgS_pE
1•vermilingua•1h ago•0 comments

Essential CDN: The CDN that lets you do more than JavaScript

https://essentialcdn.fluidity.workers.dev/
1•telui•1h ago•1 comments

They Hijacked Our Tech [video]

https://www.youtube.com/watch?v=-nJM5HvnT5k
2•cedel2k1•1h ago•0 comments

Vouch

https://twitter.com/mitchellh/status/2020252149117313349
41•chwtutha•1h ago•7 comments

HRL Labs in Malibu laying off 1/3 of their workforce

https://www.dailynews.com/2026/02/06/hrl-labs-cuts-376-jobs-in-malibu-after-losing-government-work/
4•osnium123•1h ago•1 comments

Show HN: High-performance bidirectional list for React, React Native, and Vue

https://suhaotian.github.io/broad-infinite-list/
2•jeremy_su•1h ago•0 comments

Show HN: I built a Mac screen recorder Recap.Studio

https://recap.studio/
1•fx31xo•1h ago•1 comments