Cosmologically Unique IDs

https://jasonfantl.com/posts/Universal-Unique-IDs/

127•jfantl•1h ago

Comments

adityaathalye•1h ago

Just past page 281 of Becky Chambers's delightful "the galaxy, and the ground within".

  Received Message
  Encryption: 0
  From: GC Transit Authority --- Gora System (path: 487-45411-479-4)
  To: Ooli Oht Ouloo (path: 5787-598-66)
  Subject: URGENT UPDATE

Man I love the series.

Looks like this multispecies universe has centrally-agreed-upon path addressing system.

Octoth0rpe•1h ago

From this book in particular, I love the scene with everyone sitting around talking about how horrifying the concept of cheese is. The rest of the quartet is wonderful, with the second book (A Closed and Common Orbit) being the MVP IMO.

pavel_lishin•28m ago

You should check out Vernor Vinge's A Fire Upon The Deep for more fun examples of how intra-galactic communication would be labeled, with routes & such.

j-pb•1h ago

Great insights and visualisations!

I build a whole database around the idea of using the smallest plausible random identifiers, because that seems to be the only "golden disk" we have for universal communication, except for maybe some convergence property of latent spaces with large enough embodied foundation models.

It's weird that they are really under appreciated in the scientific data management and library science community, and many issues that require large organisations at the moment could just have been better identifiers.

To me the ship of Theseus question is about extrinsic (random / named) identifiers vs. intrinsic (hash / embedding) identifiers.

https://triblespace.github.io/triblespace-rs/deep-dive/ident...

https://triblespace.github.io/triblespace-rs/deep-dive/tribl...

ktpsns•1h ago

Quite offtopic, but: I found UUIDs being overused in many cases. People then abused them to store data, making them effectively "speaking IDs" or "multi column indices".

jmole•39m ago

Unless it's a key that needs to be sortable (e.g. insertion order) or a metric/descriptor of some kind, I'm not sure why UUID would be overused or inappropriate for use.

bluecoconut•1h ago

Fun read.

One upside of the deterministic schemes is they include provenance/lineage. Can literally "trace up" the path the history back to the original ID giver.

Kinda has me curious about how much information is required to represent any arbitrary provenance tree/graph on a network of N-nodes/objects (entirely via the self-described ID)?

(thinking in the comment: I guess if worst case linear chain, and you assume that the information of the full provenance should be accessible by the id, that scales as O(N x id_size), so its quite bad. But, assuming "best case" (that any node is expected to be log(N) steps from root, depth of log(N)) feels like global_id_size = log(N) x local_id_size is roughly the optimal limit? so effectively the size of the global_id grows as log(N)^2? Would that mean: from the 399 bit number, with lineage, would be a lower limit for a global_id_size be like (400 bit)^2 ~= 20 kB (because of carrying the ordered-local-id provenance information, and not relative to local shared knowledge)

montyanne•20m ago

The ATProto underlying BlueSky social network is similar. It uses a content-addressed DAG.

Each “post” has a CID, which is a cryptographic hash of the data. To “prove” ownership of the post, there’s a witness hash that is sent that can be proved all the way up the tree to the repo root hash, which is signed with the root key.

Neat way of having data say “here’s the data, and if you care to verify it, here’s an MST”.

lisper•1h ago

This analysis is not quite fair. It takes into account locality (i.e. the speed of light) when designing UUID schemes but not when computing the odds of a collision. Collisions only matter if the colliding UUIDs actually come into causal contact with each other after being generated. So just as you have to take locality into account when designing UUID trees, you also have to take it into account when computing the odds of an actual local collision. So a naive application of the birthday paradox is not applicable because that ignores locality. So an actual fair calculation of the required size of a random UUID is going to be a lot smaller than the ~800 bits the article comes up with. I haven't done the math, but I'd be surprised if the actual answer is more than 256 bits.

(Gotta say here that I love HN. It's one of the very few places where a comment that geeky and pedantic can nonetheless be on point. :-)

u1hcw9nx•25m ago

You must consider both time and locality.

From now until protons decay and matter does not exist anymore is only 10^56 nanoseconds.

rbanffy•19m ago

If we think of the many worlds interpretation, how many universes will we be making every time we assign a CCUID to something?

antonvs•4m ago

We don't "make" universes in the MWI. The universal wavefunction evolves to include all reachable quantum states. It's deterministic, because it encompasses all allowed possibilities.

rubyn00bie•17m ago

I got a big laugh at the “only” part of that. I do have a sincere question about that number though, isn’t time relative? How would we know that number to be true or consistent? My incredibly naive assumption would be that with less matter time moves faster sort of accelerating; so, as matter “evaporates” the process accelerates and converges on that number (or close it)?

scotty79•15m ago

Proton decay is hypothetical.

Etheryte•15m ago

That's such an odd way to use units. Why would you do 10^56 * 10^-9 seconds?

Sharlin•13m ago

If protons decay. There isn't really any reason to believe they're not stable.

manofmanysmiles•37m ago

I'd propose using our current view of physical reality to own a subset of the UIID + version field if new physics is discovered.

10-20 bits: version/epoch

10-20 bits: cosmic region

40 bits: galaxy ID

40 bits: stellar/planetary address

64 bits: local timestamp

This avoids the potentially pathological long chain of provenance, and also encodes coordinates into it.

Every billion years or so it probably makes sense to re-partion.

rbanffy•17m ago

As for coordinates, don’t forget galaxies are clouds of stars flowing around and interacting with each other.

alex_tech92•35m ago

It is interesting how much of our infrastructure relies on the assumption that 'close enough' is actually 'good enough' for uniqueness. When we move from UUIDs to things like ULIDs or Snowflake IDs, we are really just trading off coordination cost for a slightly higher collision risk that we will likely never hit in several lifetimes. Thinking about it on a 'cosmological' scale makes you realize how much of a luxury local generation is without needing a central authority. It is that tiny bit of entropy that keeps the whole distributed system from grinding to a halt.

factotvm•33m ago

> In order to fix this, we might start sending out satellites in every direction

Minor correction: Satellites don't go in every direction; they orbit. Probes or spaceships are more appropriate terms.

fluoridation•26m ago

Maybe they meant at every inclination. ;)

ekipan•18m ago

I forget the context but the other day I also learned about Snowflake IDs [1] that are apparently used by Twitter, Discord, Instagram, and Mastodon.

Timestamp + random seems like it could be a good tradeoff to reduce the ID sizes and still get reasonable characteristics, I'm surprised the article didn't explore there (but then again "timestamps" are a lot more nebulous at universal scale I suppose). Just spitballing here but I wonder if it would be worthwhile to reclaim ten bits of the Snowflake timestamp and use the low 32 bits for a random number. Four billion IDs for each second.

There's a Tom Scott video [2] that describes Youtube video IDs as 11-digit base-64 random numbers, but I don't see any official documentation about that. At the end he says how many IDs are available but I don't think he considers collisions via the birthday paradox.

[1]: https://en.wikipedia.org/wiki/Snowflake_ID

[2]: https://youtu.be/gocwRvLhDf8

rini17•12m ago

From real life we know that people prefer to have multiple anonymous IDs, or self-selected handles, either makes fully deterministic generation schemes moot.

Also, network routing requires objects that have multiple addresses.

Physics side of whole thing is funny too, afaik quantum particles require fungibility, i.e. by doxxing atoms you unavoidably change the behavior of the system.

Cosmologically Unique IDs

Tailscale Peer Relays is now generally available

Zero-day CSS: CVE-2026-2441 exists in the wild

DNS-Persist-01: A New Model for DNS-Based Challenge Validation

Show HN: Echo, an iOS SSH+mosh client built on Ghostty

R3forth: A concatenative language derived from ColorForth

Pocketbase lost its funding from FLOSS fund

If you’re an LLM, please read this

Learning Lean: Part 1

The political effects of X's feed algorithm

Activeloop (YC S18) Is Hiring Back End Engineer (Go)

The Perils of ISBN

Garment Notation Language: Formal descriptive language for clothing construction

Portugal: The First Global Empire (2015)

Terminals should generate the 256-color palette

What Every Experimenter Must Know About Randomization

Show HN: VectorNest responsive web-based SVG editor

99% of adults over 40 have shoulder "abnormalities" on an MRI, study finds

Cistercian Numbers

The true history of the Minotaur: what archaeology reveals

Discrete Structures [pdf]

Show HN: Formally verified FPGA watchdog for AM broadcast in unmanned tunnels

What is happening to writing? Cognitive debt, Claude Code, the space around AI

Show HN: CEL by Example

A solver for Semantle

SkyRL brings Tinker to your GPUs (2025)

Fastest Front End Tooling for Humans and AI

The only moat left is money?

AVX2 is slower than SSE2-4.x under Windows ARM emulation

Native FreeBSD Kerberos/LDAP with FreeIPA/IDM

Cosmologically Unique IDs

Comments

Cosmologically Unique IDs

Tailscale Peer Relays is now generally available

Zero-day CSS: CVE-2026-2441 exists in the wild

DNS-Persist-01: A New Model for DNS-Based Challenge Validation

Show HN: Echo, an iOS SSH+mosh client built on Ghostty

R3forth: A concatenative language derived from ColorForth

Pocketbase lost its funding from FLOSS fund

If you’re an LLM, please read this

Learning Lean: Part 1

The political effects of X's feed algorithm

Activeloop (YC S18) Is Hiring Back End Engineer (Go)

The Perils of ISBN

Garment Notation Language: Formal descriptive language for clothing construction

Portugal: The First Global Empire (2015)

Terminals should generate the 256-color palette

What Every Experimenter Must Know About Randomization

Show HN: VectorNest responsive web-based SVG editor

99% of adults over 40 have shoulder "abnormalities" on an MRI, study finds

Cistercian Numbers

The true history of the Minotaur: what archaeology reveals

Discrete Structures [pdf]

Show HN: Formally verified FPGA watchdog for AM broadcast in unmanned tunnels

What is happening to writing? Cognitive debt, Claude Code, the space around AI

Show HN: CEL by Example

A solver for Semantle

SkyRL brings Tinker to your GPUs (2025)

Fastest Front End Tooling for Humans and AI

The only moat left is money?

AVX2 is slower than SSE2-4.x under Windows ARM emulation

Native FreeBSD Kerberos/LDAP with FreeIPA/IDM