TPU (Tensor Processing Unit) Deep Dive

https://henryhmko.github.io/posts/tpu/tpu.html

139•transpute•5h ago

Comments

jan_Sate•4h ago

I thought that it would be about 3D printer filament.

almostgotcaught•4h ago

> In essence, caches allow hardware to be flexible and adapt to a wide range of applications. This is a large reason why GPUs are very flexible hardware (note: compared to TPUs).

this is correct but mis-stated - it's not the caches themselves that cost energy but MMUs that automatically load/fetch/store to cache on "page faults". TPUs don't have MMUs and furthermore are a push architecture (as opposed to pull).

RossBencina•2h ago

Can you suggest a good reference for understanding which algorithms map well onto the regular grid systolic arrays used by TPUs? The fine article says dese matmul and convolution are good, but is there anything else? Eigendecomposition? SVD? matrix exponential? Solving Ax = b or AX = B? Cholesky?

WithinReason•2h ago

Anything that you can express as 128x128 (but ideally much larger) dense matrix multiplication and nothing else

musebox35•1h ago

I think https://jax-ml.github.io/scaling-book/ is one of the best references to go through. It details how single device and distributed computations map to TPU hardware features. The emphasis is on mapping the transformer computations, both forwards and backwards, so requires some familiarity with how transformer networks are structured.

serf•1h ago

does that cooling channel have a NEMA stepper on it as a pump or metering valve?[0]

If so, wild. That seems like overkill.

[0]: https://henryhmko.github.io/posts/tpu/images/tpu_tray.png

fellowmartian•13m ago

definitely closed-loop, might even be a servo

frays•56m ago

How can someone have this level of knowledge about TPUs without working at Google?

musebox35•24m ago

From the acknowledgment at the end, I guess the author has access to TPUs through https://sites.research.google/trc/about/

This is not the only way though. TPUs are available to companies operating on GCP as an alternative to GPUs with a different price/performance point. That is another way to get hands-on experience with TPUs.

ipsum2•21m ago

Everything thats in the blog post is basically well known already. Google publishes papers and gives talks about their TPUs. Many details are lacking though, and require some assumptions/best guesses. Jax and XLA are (partially) open source and give clues about how TPUs work under the hood as well.

https://arxiv.org/abs/2304.01433

https://jax-ml.github.io/scaling-book/

ariwilson•23m ago

Cool article!

TPU (Tensor Processing Unit) Deep Dive

Sound As Pure Form: Music Language Inspired by Supercollider, APL, and Forth

Show HN: Progressor – coach that breaks down big goals into actionable steps

Remote MCP Support in Claude Code

P-Hacking in Startups

LaborBerlin: State-of-the-Art 16mm Projector

Announcing the Clippy feature freeze

Finally, a Makefile formatter (50 years overdue)

The bad boy of bar charts: William Playfair (2023)

Type Inference Zoo

Denmark's Archaeology Experiment Is Paying Off in Gold and Knowledge

U.S. bombs Iranian nuclear sites

Airpass – Easily overcome WiFi time limits

When Humans Learned to Live Everywhere

Show HN: Luna Rail – treating night trains as a spatial optimization problem

Samsung embeds IronSource spyware app on phones across WANA

P2piano: A P2P collaboration space for the musically inclined

AllTracker: Efficient Dense Point Tracking at High Resolution

Phoenix.new – Remote AI Runtime for Phoenix

Delta Chat is a decentralized and secure messenger app

uBlock Origin Lite Beta for Safari iOS

Scaling our observability platform by embracing wide events and replacing OTel

Tell HN: Beware confidentiality agreements that act as lifetime non competes

Using Microsoft's New CLI Text Editor on Ubuntu

Compact Representations for Arrays in Lua [pdf]

Compiler for the B Programming Language

Unexpected security footguns in Go's parsers

Linux on the Behringer X32 [video]

ARIA, the UK's Bet to Build Scientific Revolutions

'Gwada negative': French scientists find new blood type in woman

TPU (Tensor Processing Unit) Deep Dive

Comments

TPU (Tensor Processing Unit) Deep Dive

Sound As Pure Form: Music Language Inspired by Supercollider, APL, and Forth

Show HN: Progressor – coach that breaks down big goals into actionable steps

Remote MCP Support in Claude Code

P-Hacking in Startups

LaborBerlin: State-of-the-Art 16mm Projector

Announcing the Clippy feature freeze

Finally, a Makefile formatter (50 years overdue)

The bad boy of bar charts: William Playfair (2023)

Type Inference Zoo

Denmark's Archaeology Experiment Is Paying Off in Gold and Knowledge

U.S. bombs Iranian nuclear sites

Airpass – Easily overcome WiFi time limits

When Humans Learned to Live Everywhere

Show HN: Luna Rail – treating night trains as a spatial optimization problem

Samsung embeds IronSource spyware app on phones across WANA

P2piano: A P2P collaboration space for the musically inclined

AllTracker: Efficient Dense Point Tracking at High Resolution

Phoenix.new – Remote AI Runtime for Phoenix

Delta Chat is a decentralized and secure messenger app

uBlock Origin Lite Beta for Safari iOS

Scaling our observability platform by embracing wide events and replacing OTel

Tell HN: Beware confidentiality agreements that act as lifetime non competes

Using Microsoft's New CLI Text Editor on Ubuntu

Compact Representations for Arrays in Lua [pdf]

Compiler for the B Programming Language

Unexpected security footguns in Go's parsers

Linux on the Behringer X32 [video]

ARIA, the UK's Bet to Build Scientific Revolutions

'Gwada negative': French scientists find new blood type in woman