frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Which H100 Instance to Train Nanochat – Benchmarking PCIe, SXM, and NVL

https://bluenotebook.io/blog/h100-nanochat-training/
1•k2so•1h ago

Comments

k2so•1h ago
Author here. I wanted to train Nanochat d26 to GPT-2 level and had to pick between three H100 variants on Runpod.

SXM was the most expensive per hour but cheapest to finish: SXM: 702ms/step - ~$37 (using vast.ai) PCIe: 1,412ms/step - ~$112 (runpod) NVL: 2,032ms/step - ~$181 (runpod)

My first SXM run hit 1,295ms. Barely faster than PCIe. Nsight OS runtime summary led me to suspect CPU starvation. I found a higher vCPU instance on Vast.ai which hit 700ms. The 128 vCPU SXM instance also hit ~700ms, so it wasn't CPU count.

Looking at the network topology on Runpod and vast.ai, the first instance had GPUs split 4+4 across two NUMA nodes. NCCL's data transfer uses NVSwitch and is unaffected, but the control threads run on CPU. Cross-socket latency on every pthread_cond_signal added up.

NVL was the most confusing result, NCCL kernel times nearly identical to PCIe, but step times 44% worse. Only 4 of 28 GPU pairs share NVLink on NVL, the rest fall back to PCIe. I don't have a full explanation for this yet.

Profiling script: https://github.com/Nikhil-Kasukurthi/nanochat/blob/master/sc... Script with startup checks to ensure instance is healthy: https://github.com/Nikhil-Kasukurthi/nanochat/blob/master/ru...

Happy to discuss, especially if anyone has ideas on the NVL anomaly.

CamperBob2•1h ago
I'd just get an RTX Pro 6000 Blackwell and call it a day. More VRAM. Somewhat less bandwidth but it's your bandwidth.
k2so•1h ago
Yeah, for a single GPU inference, considering the higher VRAM and FP4 support on the RTX 6000, it should fit larger models as well than the H100.

I keep building projects nobody wants. So this time I'm doing it backwards

2•thefern•2m ago•0 comments

Amazon degraded shopping- you have to put in cart to see the price

3•talkingtab•3m ago•0 comments

Show HN: A user daemon to provide an age-bracketing API

https://github.com/danudey/aged
1•danudey•3m ago•0 comments

Show HN: Google A2A for Elixir with GenServer-like ergonomics

https://github.com/actioncard/a2a-elixir
1•maxekman•4m ago•0 comments

Century of Humiliation

https://en.wikipedia.org/wiki/Century_of_humiliation
1•mefengl•5m ago•0 comments

Modular: Structured Mojo Kernels

https://www.modular.com/blog/structured-mojo-kernels-part-1-peak-performance-half-the-code
2•tosh•6m ago•0 comments

Show HN: Museum Music

https://museummusic.samrawal.com/
1•zora_goron•8m ago•0 comments

Judges to AG: It's OK for the Gov't to Dox People, but Not the Other Way Around?

https://www.techdirt.com/2026/03/05/judges-to-ag-pam-bondi-its-ok-for-the-govt-to-dox-people-but-...
4•hn_acker•9m ago•2 comments

Show HN: Git Diff for Agentic Coding

https://github.com/msoedov/justshowmediff
1•alex_mia•9m ago•0 comments

'Our consciousness is under siege': On chatbots, social media and mental freedom

https://www.theguardian.com/wellness/2026/mar/05/michael-pollan-book-a-world-appears-consciousnes...
6•billybuckwheat•10m ago•0 comments

Show HN: History Snacks – explore 9k historical events by date

https://historysnacks.io/
1•dmujeeb•16m ago•0 comments

Pentagon Formally Labels Anthropic Supply-Chain Risk

https://www.wsj.com/politics/national-security/pentagon-formally-labels-anthropic-supply-chain-ri...
7•klausa•16m ago•0 comments

SpaceX launches rockets with Excel. Here's why we're trying to replace it

https://docs.synnaxlabs.com/blog/introducing-arc
3•embonilla•16m ago•1 comments

Show HN: DocMCP – Index any docs site locally, search it from Claude via MCP

1•pieeee•16m ago•0 comments

Section 230 Isn't the Problem: Debating the Law on the Majority Report

https://www.techdirt.com/2026/03/05/section-230-isnt-the-problem-debating-the-law-on-the-majority...
1•hn_acker•19m ago•0 comments

Let's Get Physical

https://m4iler.cloud/posts/lets-get-physical/
6•MBCook•19m ago•0 comments

How Iran is using cheap drones to cause chaos across the Middle East

https://www.bbc.co.uk/news/resources/idt-b3a272f0-3e10-4f95-9cd1-b34ab8ad033c
4•tartoran•21m ago•0 comments

What if it's World War III?

https://colinbeavan.substack.com/p/what-if-its-world-war-iii
3•ObiOnePierogi•22m ago•0 comments

ELife Fallout

https://nikomc.com/2026/03/05/elife-fallout/
1•mailyk•23m ago•0 comments

AI as the "New Air"

https://futurium.ec.europa.eu/en/apply-ai-alliance/posts/ai-new-air
2•dlidnl•24m ago•1 comments

GPT-5.4 Is the Best OpenAI Model for SRE That We've Seen on Our SRE Benchmark

https://twitter.com/LaurenceLiang1/status/2029633049906872705
1•larryll•26m ago•0 comments

Show HN: Arcane Agents – A visual control room for terminal AI agents

https://github.com/thomasrice/arcane-agents
1•damanamathos•26m ago•0 comments

Hormuz Is the Hidden Risk to the AI Economy

https://www.bloomberg.com/opinion/articles/2026-03-05/iran-war-hormuz-is-the-hidden-risk-to-the-a...
3•geox•28m ago•0 comments

Living the metascience dream (or nightmare) with AI for science

https://jessicahullman.substack.com/p/living-the-metascience-dream-or-nightmare
2•eamag•28m ago•0 comments

Entity component systems for beginners: learning Rust on easy-mode [video]

https://www.youtube.com/watch?v=PXEc-WCGFBQ
1•weinzierl•29m ago•0 comments

Show HN: ClickArmor – Countering ClickFix social engineering in browser

https://chromewebstore.google.com/detail/clickarmor/gbbiaedhdapkbfmjgpepebidjpiphgmm
2•ditm-security•29m ago•0 comments

Personalized fMRI models decode moment-to-moment chronic pain in fibromyalgia

https://medicalxpress.com/news/2026-03-personalized-fmri-decode-moment-chronic.html
1•PaulHoule•30m ago•0 comments

Show HN: Anima – Give your projects a soul (autonomous AI dev cycles)

https://github.com/saltbo/anima
1•saltbo•30m ago•1 comments

Trump fires Homeland Security Secretary Noem after criticism

https://apnews.com/article/trump-homeland-security-noem-mullin-38c583b3cef97b4ef60d84b8f8b5961a
14•Agreed3750•30m ago•2 comments

Bill in New York State Would Protect Lawyers from AI Competition

https://reason.com/2026/03/04/this-bill-in-new-york-state-would-protect-lawyers-from-ai-competition/
2•mhb•31m ago•0 comments