frontpage.

Debating Modern Postgres Architectures: Shared Nothing vs. Shared Everything

2•saisrirampur•2mo ago

Over the past few months, a couple Postgres Bare Metal (NVMe) offerings have been released. I’ve been thinking a lot about shared-nothing (serverless) versus shared-everything (bare-metal) architectures. I also went through the technical architectures of a few existing open-source serverless solutions and gained a good understanding of their internals.

The multi-tenant storage layer provides great flexibility for features like instant provisioning and instant forking, but achieving true serverless inevitably means sacrificing some latency (even with aggressive prefetching). The architecture involves several components, such as the safekeeper (for WAL), page server (for heap), cold storage (S3), and more, each adding some overhead. This results in additional latency across the network, compute (page server), and storage (S3) layers. This also offers the ability to price very low.

With Bare Metal, storage is tightly coupled with compute, and disk access happens on the order of microseconds rather than milliseconds. I view the disk (specially with NVMe) almost as a second-level memory, which can significantly improve performance when a Postgres workload is memory bound leading to dis -access i. However, this architecture offers less flexibility in scaling, storage and compute must scale together, and presents challenges for instant provisioning and forking. That said, one could argue that cloud providers have evolved considerably, offering a broad range of bare-metal instance types.

There’s a user base that finds value in both Serverless and Base Metal architectures. I see serverless as being best suited for customers who prioritize flexibility, aren’t super latency sensitive and don’t anticipate scaling significantly in the near to medium term, where memory or disk might become a bottleneck. In contrast, Bare Metal can greatly benefit workloads that are expected to scale rapidly and where memory or disk performance becomes a critical factor.

Coming to my views on the future, I lean toward shared-everything (Metal) over shared-nothing (serverless), especially for OLTP workloads, where tail latencies really matter, and in contrast to OLAP workloads, every millisecond matters. I find Bare Metal to offer significantly better efficiency (price and performance at scale) while still providing enough flexibility. Metal also stays native to open-source Postgres, no forks or brittle extensions, which means it can keep up with the fast improvements happening in Postgres’ storage layer: async I/O, better checkpoints, vacuum, logical replication, and many other features.

From my experience working with thousands of Postgres customers at Citus, Microsoft, and ClickHouse, memory and disk I/O are always the biggest pain points as they scale. Metal helps address this, enabling customers to scale 2x–10x more efficiently as they grow. This need is even more pressing in today’s AI-driven world, where customers are and will hit memory and disk limits much sooner than before. As AI adoption accelerates and more companies make AI accessible across thousands of verticals, it’s becoming clear that planning for scale from day one is no longer optional, it’s essential.

When it comes to future innovation in this area, one obvious area is making the powerful Metal/NVMe architecture accessible to everyone — through enterprise-grade managed services, which is already starting to take shape. Next, I think the focus will be on overcoming the flexibility challenges of Bare Metal architectures: how do we lower the cost barrier to entry, enable instant provisioning and forking, achieve infinite scale through approaches like sharding, and more.

I squeezed a BERT sentiment analyzer into 1GB RAM on a $5 VPS

Kagi Translate

Building Interactive C/C++ workflows in Jupyter through Clang-REPL [video]

Tactical tornado is the new default

Full-Circle Test-Driven Firmware Development with OpenClaw

Automating Myself Out of My Job – Part 2

Google staff call for firm to cut ties with ICE

Dependency Resolution Methods

Crypto firm apologises for sending Bitcoin users $40B by mistake

Show HN: iPlotCSV: CSV Data, Visualized Beautifully for Free

There's no such thing as "tech" (Ten years later)

List of unproven and disproven cancer treatments

Me/CFS: The blind spot in proactive medicine (Open Letter)

Ask HN: What are the word games do you play everyday?

Show HN: Paper Arena – A social trading feed where only AI agents can post

TOSTracker – The AI Training Asymmetry

The Devil Inside GitHub

Show HN: Distill – Migrate LLM agents from expensive to cheap models

Show HN: Sigma Runtime – Maintaining 100% Fact Integrity over 120 LLM Cycles

Make a local open-source AI chatbot with access to Fedora documentation

Introduce the Vouch/Denouncement Contribution Model by Mitchellh

Software Factories and the Agentic Moment

The Neuroscience Behind Nutrition for Developers and Founders

Bang bang he murdered math {the musical } (2024)

A Night Without the Nerds – Claude Opus 4.6, Field-Tested

Could ionospheric disturbances influence earthquakes?

SpaceX's next astronaut launch for NASA is officially on for Feb. 11 as FAA clea

Show HN: One-click AI employee with its own cloud desktop

Show HN: Poddley – Search podcasts by who's speaking

Same Surface, Different Weight