news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Blogpost: A Mental Model for GPU Engineering for LLMs

https://modelcraft.substack.com/p/fundamentals-of-gpu-engineering

2•goabiaryan•1h ago

Comments

goabiaryan•1h ago

In my recent post on GPU engineering for LLMs, I argue that obsessing over CUDA kernel engineering isn’t the best starting point—understanding the full system stack, from model definition to hardware limits, is far more critical.

System level thinking helps you spot whether you’re compute-bound, memory-bound, or communication-bound before diving into low-level optimizations.

But I’m curious: for engineers breaking into inference engineering, where do you recommend starting? Should newcomers focus on mastering profiling tools and frameworks like PyTorch or JAX first or jump headfirst into distributed systems right away?

Also, I downplay kernel engineering in the post but are any specific scenarios where hand-tuned kernels have been a game-changer for you?

Centia.io, a simple Back end as a Service

https://centia.netlify.app/

1•mhoegh•1m ago•0 comments

Immigration raid on Chicago apartment building leaves residents reeling

https://chicago.suntimes.com/immigration/2025/10/01/massive-immigration-raid-on-chicago-apartment...

1•ripe•1m ago•0 comments

AI wrote nearly a quarter of press releases in 2024

https://www.eurekalert.org/news-releases/1099641

1•bookofjoe•4m ago•0 comments

Protecting consumers in a post-consent world

https://www.stanfordlawreview.org/online/protecting-consumers-in-a-post-consent-world/

1•hhs•6m ago•0 comments

Test-ipv6.com will shut down in December 2025

https://retire.test-ipv6.com/

1•luckman212•6m ago•0 comments

Could tariff revenue fund $2k stimulus checks? Trump thinks so

https://economictimes.indiatimes.com/news/international/us/could-tariff-revenue-fund-2000-stimulu...

2•NomDePlum•8m ago•0 comments

Oswald – Object Storage Write-Ahead Log Device

https://nvartolomei.com/oswald/

2•todsacerdoti•9m ago•0 comments

Ask HN: Systems Development Path

2•__all__•10m ago•0 comments

Double Ring System Is the Most Powerful Odd Radio Circle Found So Far

https://www.sciencealert.com/stunning-double-ring-system-is-the-most-powerful-odd-radio-circle-fo...

1•Brajeshwar•13m ago•0 comments

Apple Shifts Focus from Vision Pro to Smart Glasses

https://www.tuaw.com/2025/10/03/apple-shifts-focus-from-vision-pro-to-smart-glasses/

1•Brajeshwar•14m ago•0 comments

The Fight to Fix Prostate Cancer Care

https://www.bloomberg.com/news/articles/2025-10-02/focal-therapy-for-prostate-cancer-care-is-an-e...

1•melling•15m ago•1 comments

Bay Area university issues warning over man using Meta AI glasses on campus

https://www.sfgate.com/bayarea/article/meta-glasses-university-san-francisco-warning-21082719.php

3•pseudolus•16m ago•0 comments

Inside China's New Wave of Conceptually Innovative Bookstores

https://lithub.com/inside-chinas-new-wave-of-conceptually-innovative-bookstores/

3•pseudolus•18m ago•0 comments

Trump offers universities a choice: Comply for preferential funding

https://arstechnica.com/science/2025/10/trump-offers-universities-a-choice-comply-for-preferentia...

1•dybber•19m ago•0 comments

Hacked Ford screens put anti-RTO slogan above CEO's face

https://www.theregister.com/2025/10/04/ford_rto_hack/

2•rntn•21m ago•0 comments

GenBI is the next big thing in AI

https://www.autonomousminds.ai/post/genbi-dont-let-your-decisions-die-in-dashboards

3•koebs•22m ago•1 comments

Motherhood Flips Brain Switch That Triggers Aggression

https://neurosciencenews.com/motherhood-aggression-oxytocin-29765/

1•amichail•23m ago•0 comments

Sinkhorn: Make LLMs even smaller through quantisation while maintaining accuracy

https://github.com/huawei-csl/SINQ/blob/main/README.md

1•ilitirit•23m ago•0 comments

Mapgen4 River Shader

https://www.redblobgames.com/blog/2025-09-30-mapgen4-river-shader/

1•ingve•23m ago•0 comments

Verba Volant, Scripta Manent

https://en.wikipedia.org/wiki/Verba_volant,_scripta_manent

2•thunderbong•24m ago•0 comments

Google removes ICE-spotting app following Apple's ICEBlock crackdown

https://www.theverge.com/news/791533/google-apple-ice-tracking-app-store-red-dot-iceblock

2•funkyfourier•26m ago•0 comments

Show HN: BetterSelf – an app to never forget the lessons and ideas you learn

https://apps.apple.com/ch/app/betterself/id6752654530?l=en-GB

1•adamgol•31m ago•0 comments

Scaling Test Time Compute

https://arxiv.org/abs/2510.01123

1•math-llm-agi•32m ago•0 comments

US unveils design for $1 Trump coin to mark 250th independence

https://www.reuters.com/world/us/us-considers-1-trump-coin-mark-250th-independence-celebrations-2...

6•geox•32m ago•2 comments

Thunderscan: A clever device transforms a printer into a scanner

https://www.folklore.org/Thunderscan.html

1•dtgriscom•33m ago•0 comments

Show HN: Unstable Singularity Detector

https://github.com/Flamehaven/unstable-singularity-detector

1•Flamehaven01•35m ago•0 comments

Ask HN: Looking for a technical co-founder for a digital legacy platform

1•officialpage•37m ago•0 comments

Scientists grow mini human brains to power computers

https://www.bbc.com/news/articles/cy7p1lzvxjro

2•RickJWagner•39m ago•3 comments

Ukraine in US for Talks on $50B Joint Production 'Drone Deal'

https://www.kyivpost.com/post/61262

3•breve•49m ago•0 comments

Creating Per-Project MCP Servers

https://taoofmac.com/space/blog/2025/10/04/1111

1•rcarmo•51m ago•0 comments