Show HN: Managed Postgres with native ClickHouse integration

45•saisrirampur•2w ago

Hello HN, this is Sai and Kaushik from ClickHouse. Today we are launching a Postgres managed service that is natively integrated with ClickHouse. It is built together with Ubicloud (YC W24).

TL;DR: NVMe-backed Postgres + built-in CDC into ClickHouse + pg_clickhouse so you can keep your app Postgres-first while running analytics in ClickHouse.

Try it (private preview): https://clickhouse.com/cloud/postgres Blog w/ live demo: https://clickhouse.com/blog/postgres-managed-by-clickhouse

Problem

Across many fast-growing companies using Postgres, performance and scalability commonly emerge as challenges as they grow. This is for both transactional and analytical workloads. On the OLTP side, common issues include slower ingestion (especially updates, upserts), slower vacuums, long-running transactions incurring WAL spikes, among others. In most cases, these problems stem from limited disk IOPS and suboptimal disk latency. Without the need to provision or cap IOPS, Postgres could do far more than it does today.

On the analytics side, many limitations stem from the fact that Postgres was designed primarily for OLTP and lacks several features that analytical databases have developed over time, for example vectorized execution, support for a wide variety of ingest formats, etc. We’re increasingly seeing a common pattern where many companies like GitLab, Ramp, Cloudflare etc. complement Postgres with ClickHouse to offload analytics. This architecture enables teams to adopt two purpose-built open-source databases.

That said, if you’re running a Postgres based application, adopting ClickHouse isn’t straightforward. You typically end up building a CDC pipeline, handling backfills, and dealing with schema changes and updating your application code to be aware of a second database for analytics.

Solution

On the OLTP side, we believe that NVMe-based Postgres is the right fit and can drastically improve performance. NVMe storage is physically colocated with compute, enabling significantly lower disk latency and higher IOPS than network-attached storage, which requires a network round trip for disk access. This benefits disk-throttled workloads and can significantly (up to 10x) speed up operations incl. updates, upserts, vacuums, checkpointing, etc. We are working on a detailed blog examining how WAL fsyncs, buffer reads, and checkpoints dominate on slow I/O and are significantly reduced on NVMe. Stay tuned!

On the OLAP side, the Postgres service includes native CDC to ClickHouse and unified query capabilities through pg_clickhouse. Today, CDC is powered by ClickPipes/PeerDB under the hood, which is based on logical replication. We are working to make this faster and easier by supporting logical replication v2 for streaming in-progress transactions, a new logical decoding plugin to address existing limitations of logical replication, working toward sub-second replication, and more.

Every Postgres comes packaged with the pg_clickhouse extension, which reduces the effort required to add ClickHouse-powered analytics to a Postgres application. It allows you to query ClickHouse directly from Postgres, enabling Postgres for both transactions and analytics. pg_clickhouse supports comprehensive query pushdown for analytics, and we plan to continuously expand this further (https://news.ycombinator.com/item?id=46249462).

Vision

To sum it up - Our vision is to provide a unified data stack that combines Postgres for transactions with ClickHouse for analytics, giving you best-in-class performance and scalability on an open-source foundation.

Get Started

We are actively working with users to onboard them to the Postgres service. Since this is a private preview, it is currently free of cost.If you’re interested, please sign up here. https://clickhouse.com/cloud/postgres

We’d love to hear your feedback on our thesis and anything else that comes to mind, it would be super helpful to us as we build this out!

Comments

scottmas•2w ago

Looks pretty awesome! Especially the native joins between warehouse tables and the OLTP db.

Will pricing likely just be a percent markup over the (excellent) Ubicloud prices they have listed? (https://www.ubicloud.com/docs/about/pricing)

saisrirampur•2w ago

Thank you for chiming in. Pricing is still TBD and will be finalized in the coming months before the service goes to GA. At a high level we plan to keep competitive also try to make it inclusive of the integration features too (native CDC + pg_clickhouse). Stay tuned!

caffeinated_me•1w ago

It sounds like you're doing something similar to how Databricks works now that they've acquired neon, or Snowflake now that they got Crunchy. I'm guessing the local SSD is a big advantage, but what else is different with your approach?

saisrirampur•1w ago

Thanks for posting this question! Compared to Snowflake and Databricks, a few key differences in our approach are:

(a) An initial focus on real-time, customer-facing applications rather than trying to boil the ocean. This also aligns with where the Postgres + ClickHouse combination has really shined for our users. Both Postgres and ClickHouse are designed primarily with developers building their system of record applications.

(b) Every component in the stack is open source—Postgres, ClickHouse, PeerDB for native CDC, pg_clickhouse, and Ubicloud Postgres (our data plane component). We plan to keep it that way as much as possible, as this strongly aligns with our ethos.

(c)Third, as you noted, Postgres is NVMe-backed and the focus is on performance and scalability, while maintaining top-notch reliability. We think that this more meaningful to fast-growing (AI-driven) workloads than instant provisioning and forking. I talk about this a bit more here - https://clickhouse.com/blog/postgres-managed-by-clickhouse#p...

caffeinated_me•1w ago

Thanks! Out of curiosity, does the NVME have a big effect on replication throughput? I've been wondering how much trouble I've had with other solutions is due to parsing WAL and how much is just slow cloud disk

saisrirampur•1w ago

Very interesting question. Depends on the use-case, have seen quite a few workloads where logical replication gets throttled on I/O (reorder buffer) where NVMe based disk access should help a lot. This happens specifically when there are larger or interleaved transactions. We plan to test this at production scale soon. Stay tuned for more learnings!

sakesun•1w ago

Is it a cost disadvantage for being NVMe-backed ?

saisrirampur•1w ago

Great question! It really depends on the workload. We already support NVMe instances as small as 4 GB RAM / 2 vCPUs. For HA setups, you could go with one standby (with configurable synchronous replication) or two standbys (cross-AZ, with quorum-based replication). So yes, there is some additional cost from a hardware perspective due to the standbys, but depending on the workload, NVMe performance could offset those costs. On top of this, there’s a separate topic around the reliability/availability promises of separating storage and compute for an OLTP Postgres database.

samokhvalov•1w ago

congrats! the more postgres everywhere, the better

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

Show HN: I spent 4 years building a UI design tool with only the features I use

Show HN: If you lose your memory, how to regain access to your computer?

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

Show HN: Smooth CLI – Token-efficient browser for AI agents

Show HN: Slack CLI for Agents

Show HN: Gigacode – Use OpenCode's UI with Claude Code/Codex/Amp

Show HN: FastLog: 1.4 GB/s text file analyzer with AVX2 SIMD

Show HN: Artifact Keeper – Open-Source Artifactory/Nexus Alternative in Rust

Show HN: Horizons – OSS agent execution engine

Show HN: I built a directory of $1M+ in free credits for startups

Show HN: Falcon's Eye (isometric NetHack) running in the browser via WebAssembly

Show HN: A Kubernetes Operator to Validate Jupyter Notebooks in MLOps

Show HN: Daily-updated database of malicious browser extensions

Show HN: BioTradingArena – Benchmark for LLMs to predict biotech stock movements

Show HN: 33rpm – A vinyl screensaver for macOS that syncs to your music

Show HN: Chiptune Tracker

Show HN: A password system with no database, no sync, and nothing to breach

Show HN: Micropolis/SimCity Clone in Emacs Lisp

Show HN: Local task classifier and dispatcher on RTX 3080

Show HN: GitClaw – An AI assistant that runs in GitHub Actions

Show HN: An open-source system to fight wildfires with explosive-dispersed gel

Show HN: Agentism – Agentic Religion for Clawbots

Show HN: Disavow Generator – Open-source tool to defend against negative SEO

Show HN: BPU – Reliable ESP32 Serial Streaming with Cobs and CRC

Show HN: Craftplan – I built my wife a production management tool for her bakery

Show HN: Hibana – An Affine MPST Runtime for Rust

Show HN: Total Recall – write-gated memory for Claude Code

Show HN: Beam – Terminal Organizer for macOS

Show HN: Agent Arena – Test How Manipulation-Proof Your AI Agent Is

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

Show HN: I spent 4 years building a UI design tool with only the features I use

Show HN: If you lose your memory, how to regain access to your computer?

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

Show HN: Smooth CLI – Token-efficient browser for AI agents

Show HN: Slack CLI for Agents

Show HN: Gigacode – Use OpenCode's UI with Claude Code/Codex/Amp

Show HN: FastLog: 1.4 GB/s text file analyzer with AVX2 SIMD

Show HN: Artifact Keeper – Open-Source Artifactory/Nexus Alternative in Rust

Show HN: Horizons – OSS agent execution engine

Show HN: I built a directory of $1M+ in free credits for startups

Show HN: Falcon's Eye (isometric NetHack) running in the browser via WebAssembly

Show HN: A Kubernetes Operator to Validate Jupyter Notebooks in MLOps

Show HN: Daily-updated database of malicious browser extensions

Show HN: BioTradingArena – Benchmark for LLMs to predict biotech stock movements

Show HN: 33rpm – A vinyl screensaver for macOS that syncs to your music

Show HN: Chiptune Tracker

Show HN: A password system with no database, no sync, and nothing to breach

Show HN: Micropolis/SimCity Clone in Emacs Lisp

Show HN: Local task classifier and dispatcher on RTX 3080

Show HN: GitClaw – An AI assistant that runs in GitHub Actions

Show HN: An open-source system to fight wildfires with explosive-dispersed gel

Show HN: Agentism – Agentic Religion for Clawbots

Show HN: Disavow Generator – Open-source tool to defend against negative SEO

Show HN: BPU – Reliable ESP32 Serial Streaming with Cobs and CRC

Show HN: Craftplan – I built my wife a production management tool for her bakery

Show HN: Hibana – An Affine MPST Runtime for Rust

Show HN: Total Recall – write-gated memory for Claude Code

Show HN: Beam – Terminal Organizer for macOS

Show HN: Agent Arena – Test How Manipulation-Proof Your AI Agent Is

Show HN: Managed Postgres with native ClickHouse integration

Comments