frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Physics-based simulator for distributed LLM training and inference

https://simulator.zhebrak.io
1•zhebrak•1h ago
I built an analytical simulator that estimates MFU, training time, memory, throughput, and cost for distributed LLM training and inference. 70+ models, 25 GPUs, all major parallelism strategies (FSDP, TP, PP, EP, CP, ZeRO). Runs entirely client-side — no backend, no data collection.

Comments

zhebrak•1h ago
Built for sweeping strategies, sanity-checking cluster budgets, and building intuition for parallelism tradeoffs — not a substitute for profiling production workloads. Calibrated against published runs from Meta, DeepSeek, and NVIDIA within 1-2 percentage points MFU:

- LLaMA 3.1 405B (16K H100): 41.1% sim vs ~40% published

- DeepSeek V3 (2048 H800): 44.7% sim vs 43.7% published

- Nemotron-4 340B (6144 H100): 41.2% sim vs 41-42% published

Important caveat: the model captures physics (compute, memory bandwidth, communication) but not runtime optimisations and fused kernels, so inference is the weaker side.

Configs to try:

- LLaMA-3.1 (405B) on 16,384x NVIDIA H100 — https://simulator.zhebrak.io/?preset=llama3-405b

- Qwen3 MoE (235B) on 4x NVIDIA H200 SXM — https://simulator.zhebrak.io/?preset=qwen3-235b-inference

GitHub with benchmarks and examples: https://github.com/zhebrak/llm-cluster-simulator

If you have published training runs with MFU or throughput numbers, I'd love to hear from you to expand calibration.

Show HN: Stupid simple e-ink RSS reader

https://github.com/edleeman17/E-Ink-RSS-Reader
1•ed1727•15s ago•0 comments

Dev jobs up 10% YoY while other jobs down 5.8%. What do you see on the ground?

1•mrborgen•20s ago•0 comments

The DIY OpenClaw Assistant You'll Want to Carry

https://www.hackster.io/news/the-diy-openclaw-assistant-you-ll-actually-want-to-carry-97888b183ac1
1•toomuchtodo•48s ago•0 comments

The Secret History of Knocking on Wood: Most of human nature is not written down

https://resobscura.substack.com/p/neolithic-habits-machine-age-tools
1•benbreen•1m ago•0 comments

Show HN: Tiny-parquet – JavaScript lib to read/write Parquet in 326KB of WASM

https://github.com/nktrchk/tiny-parquet
2•Nikitaita•3m ago•0 comments

Is it just me or is reviewing PRs getting exponentially harder?

https://www.bitarch.dev/blog/the-hidden-cost-of-ai-assisted-coding
1•birdculture•3m ago•0 comments

LLM-as-a-Judge: Evaluating Output Without a Ground Truth

https://www.kerno.io/blog/llm-as-a-judge-evaluating-output-without-a-ground-truth
1•karimtr•3m ago•0 comments

Private Markets Hiring Defies Gloom with $2.5M Pay Deals

https://www.bloomberg.com/news/articles/2026-02-24/private-markets-hiring-defies-gloom-with-2-5-m...
1•petethomas•3m ago•0 comments

A Meta AI security researcher said an OpenClaw agent ran amok on her inbox

https://techcrunch.com/2026/02/23/a-meta-ai-security-researcher-said-an-openclaw-agent-ran-amok-o...
1•tobr•4m ago•0 comments

Show HN: Microgpt-ts – Full GPT in 500 lines of TypeScript, zero dependencies

https://microgpt-ts.vercel.app/
1•sdubois•4m ago•0 comments

Show HN: Maude the Unicorn Slayer – Disrupting Series-B startups in 16-bit

https://maudetheunicornslayer.com/
1•artlessbfa•4m ago•1 comments

Show HN: Scry – Test migrations against production scale copy of your DB

https://www.scrydata.com/blog/the-postmortem-we-never-wrote/
1•gmcquillan•5m ago•1 comments

Show HN: Tabularis – DB GUI where drivers are JSON-RPC executables

1•debba•7m ago•0 comments

Tesla's Europe problem keeps getting worse

https://www.cnbc.com/2026/02/24/tesla-car-sales-elon-musk-europe-autos-trump-evs.html
1•Betelbuddy•7m ago•0 comments

Claude AI Agents Built a C Compiler:What It Means for the Future of AI Coding

https://manojgopanapalli.substack.com/p/sixteen-claude-ai-agents-built-a
1•thecontentboy•7m ago•0 comments

Show HN: Harp – Offline, Org-Mode Based Personal Health Records Application

https://docs.lepisma.xyz/harp/
1•lepisma•8m ago•0 comments

Show HN: I analyze your GitHub code and generate a developer personality card

https://howyoucode.dev
1•marcelglaeser•9m ago•1 comments

The long tail of niche AI apps on demand

https://www.djmurphy.net/blog/apps-on-demand/
2•sollewitt•9m ago•0 comments

Show HN: AI-Nexus – Unified Rule Manager for Claude Code, Cursor, and Codex

https://github.com/JSK9999/ai-nexus
1•suntrix3•10m ago•1 comments

Conduit AI – AI voice agent that answers missed calls for service businesses

https://www.conduitai.io/
1•wpbluiss•10m ago•1 comments

IBM posts steepest daily drop since 2000

https://www.reuters.com/business/ibm-posts-steepest-daily-drop-since-2000-after-anthropic-says-ai...
1•saikatsg•11m ago•1 comments

China Bet Billions on Agentic AI as Commerce Becomes the New Battleground

https://manojgopanapalli.substack.com/p/chinas-hyperscalers-bet-billions
1•thecontentboy•11m ago•0 comments

The Flight – Niklaus Wirth as co-pilot

http://coraid.com/b180724-the-flight.html
1•rbanffy•12m ago•0 comments

Show HN: Waggle – A search engine for A2A protocol agents

https://waggle.zone
1•enmerk4r•12m ago•0 comments

Show HN: Jsonchunk – Parse incomplete JSON from streaming LLM responses

https://github.com/jbingen/jsonchunk
1•jbingen•13m ago•0 comments

Meta is planning stablecoin comeback in the second half of this year

https://www.coindesk.com/business/2026/02/24/mark-zuckerberg-s-meta-is-planning-stablecoin-comeba...
2•mfiguiere•13m ago•0 comments

Show HN: Omni – Open-source workplace search and chat, built on Postgres

https://github.com/getomnico/omni
1•prvnsmpth•13m ago•0 comments

ProducerAI: Music creation partner, now in Google Labs

https://blog.google/innovation-and-ai/models-and-research/google-labs/producerai/
1•doppp•14m ago•0 comments

We audited both MCP SDKs – three classes of boundary-crossing vulnerabilities

1•manuelnd•15m ago•0 comments

My perfect Music app doesn't exist

https://hicks.design/journal/my-perfect-music-app-doesnt-exist
2•dewey•15m ago•0 comments