How many registers does an x86-64 CPU have? (2020)

https://blog.yossarian.net/2020/11/30/How-many-registers-does-an-x86-64-cpu-have

28•tosh•2h ago

Comments

sylware•2h ago

Don't forget x86_64 like ARM is IP-locked, RISC-V is not.

JonChesterfield•1h ago

Good post! Stuff I didn't know x64 has. Sadly doesn't answer the "how many registers are behind rax" question I was hoping for, I'd love to know how many outstanding writes one can have to the various architectural registers before the renaming machinery runs out and things stall. Not really for immediate application to life, just a missing part of my mental cost model for x64.

fuhsnn•1h ago

Intel's next gen will add 16 more general purpose registers. Can't wait for the benchmarks.

Joker_vD•1h ago

So every function call will need to spill even more call-clobbered registers to the stack!

Like, I get that leaf functions with truly huge computational cores are a thing that would benefit from more ISA-visible registers, but... don't we have GPUs for that now? And TPUs? NPUs? Whatever those things are called?

throwaway17_17•1h ago

Why does having more more registers lead to spilling? I would assume (probably) incorrectly, that more registers means less spill. Are you talking about calls inside other calls which cause the outer scope arguments to be preemptively spilled so the inner scope data can be pre placed in registers?

CamelCaseCondo•48m ago

op is probably referring to the push all/pop all approach.

Joker_vD•10m ago

No, I don't. I use a common "spill definitely reused call-invariant registers at the prologue, spill call-clobbered registers that need to survive a call at precisely the call site" approach, see the sibling comment for the arithmetic.

Joker_vD•12m ago

So, let's take a function with 40 alive temporaries at a point where it needs to call a helper function of, say, two arguments.

On a 16 register machine with 9 call-clobbered registers and 7 call-invariant ones (one of which is the stack pointer) we put 6 temporaries into call-invariant registers (so there are 6 spills in the prologue of this big function), another 9 into the call-clobbered registers; 2 of those 9 are the helper function's arguments, but 7 other temporaries have to be spilled to survive the call. And the rest 25 temporaries live on the stack in the first place.

If we instead take a machine with 31 registers, 19 being call-clobbered and 12 call-invariant ones (one of which is a stack pointer), we can put 11 temporaries into call-invariant registers (so there are 11 spills in the prologue of this big function), and another 19 into the call-clobbered registers; 2 of those 19 are the helper function's arguments, so 17 other temporaries have to be spilled to survive the call. And the rest of 10 temporaries live on the stack in the first place.

So, there seems to be more spilling/reloading whether you count pre-emptive spills or the on-demand-at-the-call-site spills, at least to me.

jandrewrogers•59m ago

Most function calls are aggressively inlined by the compiler such that they are no longer "function calls". More registers will make that even more effective.

BobbyTables2•27m ago

How are they adding GPRs? Won’t that utterly break how instructions are encoded?

That would be a major headache — even if current instruction encodings were somehow preserved.

It’s not just about compilers and assemblers. Every single system implementing virtualization has a software emulation of the instruction set - easily 10k lines of very dense code/tables.

Joker_vD•10m ago

The same way AMD added 8 new GPRs, I imagine: by introducing a new instruction prefix.

nefsim•1h ago

Even though this post is from 2020, it’s still a classic reference. It’s especially relevant now to revisit this baseline considering Intel’s APX which aims to double the GPRs to 32. Understanding how we got here is key to appreciating where the architecture is headed next.

Ooh.directory: a place to find good blogs that interest you

Show HN: Sameshi – a ~1200 Elo chess engine that fits within 2KB

Zig – io_uring and Grand Central Dispatch std.Io implementations landed

My smart sleep mask broadcasts users' brainwaves to an open MQTT broker

Shades of Halftone

Show HN: I spent 3 years reverse-engineering a 40 yo stock market sim from 1986

How many registers does an x86-64 CPU have? (2020)

Show HN: SQL-tap – Real-time SQL traffic viewer for PostgreSQL and MySQL

Ars Technica makes up quotes from Matplotlib maintainer; pulls story

Code Storage by the Pierre Computer Company

Babylon 5 is now free to watch on YouTube

What color are your bits? (2004)

Switzerland to Vote on Capping Population at 10M

Understanding the Go Compiler: The Linker

The Sling: Humanity's Forgotten Power

The World of Harmonics – With a Coffee, Guitar and Synth

The mathematics of compression in database systems

Show HN: Data Engineering Book – An open source, community-driven guide

How the Little Guy Moved

Cogram (YC W22) – Hiring former technical founders

Homeland Security has sent out subpoenas to identify ICE critics

Common Lisp Screenshots: today's CL applications in action

GPT-5.2 derives a new result in theoretical physics

Building a TUI is easy now

Epstein's Ugly World of Science

Font Rendering from First Principles

NPMX – a fast, modern browser for the NPM registry

YouTube as Storage

Backblaze Drive Stats for 2025

Monosketch

Ooh.directory: a place to find good blogs that interest you

Show HN: Sameshi – a ~1200 Elo chess engine that fits within 2KB

Zig – io_uring and Grand Central Dispatch std.Io implementations landed

My smart sleep mask broadcasts users' brainwaves to an open MQTT broker

Shades of Halftone

Show HN: I spent 3 years reverse-engineering a 40 yo stock market sim from 1986

How many registers does an x86-64 CPU have? (2020)

Show HN: SQL-tap – Real-time SQL traffic viewer for PostgreSQL and MySQL

Ars Technica makes up quotes from Matplotlib maintainer; pulls story

Code Storage by the Pierre Computer Company

Babylon 5 is now free to watch on YouTube

What color are your bits? (2004)

Switzerland to Vote on Capping Population at 10M

Understanding the Go Compiler: The Linker

The Sling: Humanity's Forgotten Power

The World of Harmonics – With a Coffee, Guitar and Synth

The mathematics of compression in database systems

Show HN: Data Engineering Book – An open source, community-driven guide

How the Little Guy Moved

Cogram (YC W22) – Hiring former technical founders

Homeland Security has sent out subpoenas to identify ICE critics

Common Lisp Screenshots: today's CL applications in action

GPT-5.2 derives a new result in theoretical physics

Building a TUI is easy now

Epstein's Ugly World of Science

Font Rendering from First Principles

NPMX – a fast, modern browser for the NPM registry

YouTube as Storage

Backblaze Drive Stats for 2025

Monosketch

How many registers does an x86-64 CPU have? (2020)

Comments