frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The eighth-generation TPU: An architecture deep dive

https://cloud.google.com/blog/products/compute/tpu-8t-and-tpu-8i-technical-deep-dive
67•meetpateltech•3h ago

Comments

zshn25•2h ago
Splitting TPUs into dedicated training vs inference chips feels like an admission that the bottleneck has shifted from FLOPs to memory bandwidth + latency. Are future gains to come more from memory/system design than raw compute scaling? What’s that saying about Scaling laws?
xnx•2h ago
> Splitting TPUs into dedicated training vs inference chips feels like an admission that the bottleneck has shifted from FLOPs to memory bandwidth + latency.

With the expected scale of inference, it makes cost sense to make dedicated hardware for each task if the workloads are even slightly different. Probably similar to the video decoding chips in TVs not being very cheap/efficient compared to chips capable of encoding video.

sdenton4•1h ago
I think the first two paragraphs of the post are exactly saying that the bottleneck is memory... Long contexts, bigger but less flop-intensive models (moe's).

The funny thing about scaling laws is that as soon as they were known, the whole objective became learning how to break them - bending the curve, at least. They provided an incredibly useful target, but 'law' was a bit too strong a word.

mathisfun123•1h ago
> admission that the bottleneck has shifted

There's no admission - this has always been known.

ricardo81•1h ago
dupe https://news.ycombinator.com/item?id=47862497
QuantumNomad_•1h ago
They are different blog posts, written by different people at Google
ttul•1h ago
No matter how smart your large language model is, if you can’t find the energy to power it, it won’t run. I could imagine Google winning merely because their chips are more efficient. Of course, the other labs are capable of making chips, but Google has been doing it for years.
speedping•1h ago
2.764 petabytes of HBM per 8i? So that's where all the RAM went.
londons_explore•51m ago
288 TB/pod (1024 chips).
juancn•32m ago
Super interesting but it's so damn hard to find any detail.

I would love to see an instruction set reference for one of these, all you have is hardware architectural diagrams or high level APIs.

Show HN submissions tripled and now mostly have the same vibe-coded look

https://www.adriankrebs.ch/blog/design-slop/
98•hubraumhugo•1h ago•56 comments

Windows 9x Subsystem for Linux

https://social.hails.org/@hailey/116446826733136456
530•sohkamyung•5h ago•136 comments

Our eighth generation TPUs: two chips for the agentic era

https://blog.google/innovation-and-ai/infrastructure-and-cloud/google-cloud/eighth-generation-tpu...
179•xnx•3h ago•102 comments

3.4M Solar Panels

https://tech.marksblogg.com/american-solar-farms-v2.html
167•marklit•3h ago•106 comments

Treetops glowing during storms captured on film for first time

https://www.psu.edu/news/earth-and-mineral-sciences/story/treetops-glowing-during-storms-captured...
78•t-3•2h ago•14 comments

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model

https://qwen.ai/blog?id=qwen3.6-27b
80•mfiguiere•2h ago•45 comments

GitHub CLI now collects pseudoanonymous telemetry

https://cli.github.com/telemetry
219•ingve•3h ago•175 comments

Columnar Storage Is Normalization

https://buttondown.com/jaffray/archive/columnar-storage-is-normalization/
46•ibobev•3h ago•20 comments

Making RAM at Home [video]

https://www.youtube.com/watch?v=h6GWikWlAQA
499•kaipereira•1d ago•140 comments

ChatGPT Images 2.0

https://openai.com/index/introducing-chatgpt-images-2-0/
954•wahnfrieden•20h ago•833 comments

How does GPS work?

https://perthirtysix.com/how-the-heck-does-gps-work
135•alfanick•6h ago•30 comments

DuckDB 1.5.2 – SQL database that runs on laptop, server, in the browser

https://duckdb.org/2026/04/13/announcing-duckdb-152
38•janandonly•58m ago•4 comments

Kernel code removals driven by LLM-created security reports

https://lwn.net/Articles/1068928/
76•edward•3h ago•58 comments

XOR'ing a register with itself is the idiom for zeroing it out. Why not sub?

https://devblogs.microsoft.com/oldnewthing/20260421-00/?p=112247
124•ingve•9h ago•141 comments

Another Day Has Come

https://daringfireball.net/2026/04/another_day_has_come
87•ndr42•18h ago•80 comments

Monitor your Pi / OMP sessions

https://github.com/BlackBeltTechnology/pi-agent-dashboard
14•ankitg12•3d ago•1 comments

MuJoCo – Advanced Physics Simulation

https://github.com/google-deepmind/mujoco
78•modinfo•3d ago•15 comments

All your agents are going async

https://zknill.io/posts/all-your-agents-are-going-async/
99•zknill•2d ago•61 comments

Prefill-as-a-Service:KVCache of Next-Generation Models Could Go Cross-Datacenter

https://arxiv.org/abs/2604.15039
28•matt_d•3d ago•1 comments

Contact Lens Uses Microfluidics to Monitor and Treat Glaucoma

https://spectrum.ieee.org/smart-contact-lens-glaucoma-microfluidics
80•pseudolus•3d ago•2 comments

Expansion Artifacts

https://mattstromawn.com/writing/expansion-artifacts/
16•tobr•1d ago•1 comments

Drunk post: Things I've learned as a senior engineer (2021)

https://luminousmen.substack.com/p/drunk-post-things-ive-learned-as
235•zdw•15h ago•176 comments

Garbage Collection Without Unsafe Code

https://fitzgen.com/2024/02/06/safe-gc.html
86•foota•3d ago•32 comments

Windows Server 2025 Runs Better on ARM

https://jasoneckert.github.io/myblog/server-2025-arm64/
164•jasoneckert•3d ago•124 comments

The Vercel breach: OAuth attack exposes risk in platform environment variables

https://www.trendmicro.com/en_us/research/26/d/vercel-breach-oauth-supply-chain.html
350•queenelvis•22h ago•114 comments

CATL's new LFP battery can charge from 10 to 98% in less than 7 minutes

https://arstechnica.com/cars/2026/04/catls-new-lfp-battery-can-charge-from-10-to-98-in-less-than-...
99•PotatoNinja•4h ago•43 comments

Nobody Got Fired for Uber's $8M Ledger Mistake?

https://news.alvaroduran.com/p/nobody-got-fired-for-ubers-8-million
88•ohduran•4h ago•64 comments

SpaceX says it has agreement to acquire Cursor for $60B

https://twitter.com/spacex/status/2046713419978453374
743•dmarcos•17h ago•898 comments

Acetaminophen vs. ibuprofen

https://asteriskmag.com/issues/14/the-mystery-in-the-medicine-cabinet
581•nkurz•2d ago•373 comments

Meta to start capturing employee mouse movements, keystrokes for AI training

https://www.reuters.com/sustainability/boards-policy-regulation/meta-start-capturing-employee-mou...
717•dlx•22h ago•476 comments