frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The "setup tax" on AWS H100s is killing iterative research

3•miyamotomusashi•2h ago
I've been benchmarking the cost economics of fine tuning 70B parameter models on AWS H100 instances versus distributed consumer hardware (RTX 4090s over WAN).

The common assumption is that consumer swarms are too slow due to latency. But my modeling suggests we are ignoring the "setup tax" of the cloud.

The Data:

- Cloud (AWS): For short, iterative runs (1-2 hours), you pay for nearly 45 minutes of dead time per session just setting up environments and downloading 140GB+ weights.

- Swarm (WAN): While inference/training speed is slower (1.6x wall clock time due to network latency), the environment is persistent.

The Trade off: The math shows that for iterative research, the swarm architecture becomes ~ 57% cheaper overall, even accounting for the slower speed. You are trading latency to bypass the startup overhead and the VRAM wall.

I'm trying to validate if this trade off makes sense for real world workflows. For those finetuning 70B+ models: Is time your #1 bottleneck, or would you accept a 1.6x slowdown to cut compute costs by half ?

Built a local-first crypto P&L and TurboTax Online export tool (open source)

1•metalusmonk•51s ago•0 comments

Trump Signs Defense Bill Prohibiting China-Based Engineers in Pentagon IT Work

https://www.propublica.org/article/trump-law-microsoft-digital-escort-ban-china
1•_____k•1m ago•0 comments

What I learned building an opinionated and minimal coding agent

https://mariozechner.at/posts/2025-11-30-pi-coding-agent/
1•PaulHoule•1m ago•0 comments

The Compiler Is Your Best Friend, Stop Lying to It

https://blog.daniel-beskin.com/2025-12-22-the-compiler-is-your-best-friend-stop-lying-to-it
1•based2•2m ago•0 comments

Playing to Lose

https://powering-the-planet.ghost.io/playing-to-lose/
1•DamonHD•2m ago•0 comments

The Cost of a Closure in C, the Rest

https://thephd.dev/the-cost-of-a-closure-in-c-c2y-followup
1•gsky•3m ago•0 comments

Autonomous Medical Officer Support Software on the ISS (2024)

https://ntrs.nasa.gov/citations/20240012964
1•StatsAreFun•4m ago•0 comments

PS5 ROM Keys

https://www.psdevwiki.com/ps5/Keys#PS5_ROM_Keys
2•m00dy•4m ago•0 comments

What Is Apache Spark: Complete 2026 Guide to AI-Native Big Data Processing

https://www.netcomlearning.com/blog/apache-spark
1•based2•5m ago•0 comments

The HSBC app refuses to work if "Bitwarden" is installed on user's Android phone

https://twitter.com/nixcraft/status/2006133658495656377
1•fortran77•6m ago•1 comments

The State of LLMs 2025: Progress, Problems, and Predictions

https://magazine.sebastianraschka.com/p/state-of-llms-2025
2•ModelForge•8m ago•0 comments

Stardew Valley developer made a $125k donation to the FOSS C# framework MonoGame

https://monogame.net/blog/2025-12-30-385-new-sponsor-announcement/
5•haunter•9m ago•0 comments

Online interpreter of SAKO, a Polish 1959 programming language

https://sako-zam41.netlify.app
1•nathell•11m ago•1 comments

How I Built a Full-Stack SaaS Platform in 150 Commits (and 22 Days) with AI

https://medium.com/@smccaffrey70/how-i-built-a-full-stack-saas-platform-in-150-commits-and-22-day...
1•smccaffrey•12m ago•0 comments

By how much does your memory allocator overallocate?

https://lemire.me/blog/2025/12/30/by-how-much-does-your-memory-allocator-overallocates/
1•gsky•12m ago•0 comments

What I've done with all the extra time gained from learning the Ian Knot

https://blog.klungo.no/2025/12/31/two-years-of-the-ian-knot/
1•danielskogly•15m ago•0 comments

Notes on Building Agentic Tools Using Local LLMs

https://rdrn.dev/ai-agents/
1•sails•17m ago•1 comments

Show HN: OpenCode plugin for interactive plan annotation

https://github.com/backnotprop/plannotator/tree/main/apps/opencode-plugin
2•ramoz•18m ago•0 comments

The new bootc kickstart command in Anaconda

https://fedoramagazine.org/introducing-the-new-bootc-kickstart-command-in-anaconda/
2•coldsunrays•18m ago•0 comments

Show HN: Multimodal Search over the National Gallery of Art

https://mxp.co/r/nga
1•Beefin•20m ago•0 comments

Reusable agents meet agentic UI: AG-UI integration for Open Agent Specification

https://blogs.oracle.com/ai-and-datascience/announcing-ag-ui-integration-for-agent-spec
1•nathan_tarbert•20m ago•0 comments

The Year in Search

https://glitchads.ai/the-year-in-search/
1•vinnyglennon•22m ago•0 comments

Altered brain tissue microstructure in long Covid&recovered Covid-19 individuals

https://www.sciencedirect.com/science/article/pii/S2666354625002005
3•bookofjoe•22m ago•2 comments

Harvard Principles and Practices of Engineering Artificially Intelligent Systems

https://github.com/harvard-edge/cs249r_book
3•Terretta•23m ago•1 comments

Billionaire Superyacht Showdown: Who's in St. Barts for New Year's Eve

https://www.forbes.com/sites/jimdobson/2025/12/16/inside-the-billionaire-superyacht-rush-whos-cru...
1•DustinEchoes•24m ago•0 comments

AI in 2026

https://gusarich.com/blog/ai-in-2026/
1•Gusarich•24m ago•0 comments

The importance of free software to science

https://lwn.net/Articles/1023299/
1•leephillips•24m ago•0 comments

Xi Jinping Vows to Reunify China and Taiwan in New Year's Eve Speech

https://www.theguardian.com/world/2025/dec/31/xi-jinping-vows-reunification-china-taiwan-new-year...
13•belter•26m ago•2 comments

Show HN: 50% cheaper crypto data. Now in Python

https://github.com/qoery-com/qoery-py
1•SamTinnerholm•26m ago•0 comments

Show HN: Pixel Kit – Official Release

https://pixel-kit.vercel.app/login
1•ivanglpz•27m ago•0 comments