frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Managed Postgres with native ClickHouse integration

29•saisrirampur•4d ago
Hello HN, this is Sai and Kaushik from ClickHouse. Today we are launching a Postgres managed service that is natively integrated with ClickHouse. It is built together with Ubicloud (YC W24).

TL;DR: NVMe-backed Postgres + built-in CDC into ClickHouse + pg_clickhouse so you can keep your app Postgres-first while running analytics in ClickHouse.

Try it (private preview): https://clickhouse.com/cloud/postgres Blog w/ live demo: https://clickhouse.com/blog/postgres-managed-by-clickhouse

Problem

Across many fast-growing companies using Postgres, performance and scalability commonly emerge as challenges as they grow. This is for both transactional and analytical workloads. On the OLTP side, common issues include slower ingestion (especially updates, upserts), slower vacuums, long-running transactions incurring WAL spikes, among others. In most cases, these problems stem from limited disk IOPS and suboptimal disk latency. Without the need to provision or cap IOPS, Postgres could do far more than it does today.

On the analytics side, many limitations stem from the fact that Postgres was designed primarily for OLTP and lacks several features that analytical databases have developed over time, for example vectorized execution, support for a wide variety of ingest formats, etc. We’re increasingly seeing a common pattern where many companies like GitLab, Ramp, Cloudflare etc. complement Postgres with ClickHouse to offload analytics. This architecture enables teams to adopt two purpose-built open-source databases.

That said, if you’re running a Postgres based application, adopting ClickHouse isn’t straightforward. You typically end up building a CDC pipeline, handling backfills, and dealing with schema changes and updating your application code to be aware of a second database for analytics.

Solution

On the OLTP side, we believe that NVMe-based Postgres is the right fit and can drastically improve performance. NVMe storage is physically colocated with compute, enabling significantly lower disk latency and higher IOPS than network-attached storage, which requires a network round trip for disk access. This benefits disk-throttled workloads and can significantly (up to 10x) speed up operations incl. updates, upserts, vacuums, checkpointing, etc. We are working on a detailed blog examining how WAL fsyncs, buffer reads, and checkpoints dominate on slow I/O and are significantly reduced on NVMe. Stay tuned!

On the OLAP side, the Postgres service includes native CDC to ClickHouse and unified query capabilities through pg_clickhouse. Today, CDC is powered by ClickPipes/PeerDB under the hood, which is based on logical replication. We are working to make this faster and easier by supporting logical replication v2 for streaming in-progress transactions, a new logical decoding plugin to address existing limitations of logical replication, working toward sub-second replication, and more.

Every Postgres comes packaged with the pg_clickhouse extension, which reduces the effort required to add ClickHouse-powered analytics to a Postgres application. It allows you to query ClickHouse directly from Postgres, enabling Postgres for both transactions and analytics. pg_clickhouse supports comprehensive query pushdown for analytics, and we plan to continuously expand this further (https://news.ycombinator.com/item?id=46249462).

Vision

To sum it up - Our vision is to provide a unified data stack that combines Postgres for transactions with ClickHouse for analytics, giving you best-in-class performance and scalability on an open-source foundation.

Get Started

We are actively working with users to onboard them to the Postgres service. Since this is a private preview, it is currently free of cost.If you’re interested, please sign up here. https://clickhouse.com/cloud/postgres

We’d love to hear your feedback on our thesis and anything else that comes to mind, it would be super helpful to us as we build this out!

Comments

scottmas•4d ago
Looks pretty awesome! Especially the native joins between warehouse tables and the OLTP db.

Will pricing likely just be a percent markup over the (excellent) Ubicloud prices they have listed? (https://www.ubicloud.com/docs/about/pricing)

saisrirampur•4d ago
Thank you for chiming in. Pricing is still TBD and will be finalized in the coming months before the service goes to GA. At a high level we plan to keep competitive also try to make it inclusive of the integration features too (native CDC + pg_clickhouse). Stay tuned!
caffeinated_me•2h ago
It sounds like you're doing something similar to how Databricks works now that they've acquired neon, or Snowflake now that they got Crunchy. I'm guessing the local SSD is a big advantage, but what else is different with your approach?
saisrirampur•2h ago
Thanks for posting this question! Compared to Snowflake and Databricks, a few key differences in our approach are:

(a) An initial focus on real-time, customer-facing applications rather than trying to boil the ocean. This also aligns with where the Postgres + ClickHouse combination has really shined for our users. Both Postgres and ClickHouse are designed primarily with developers building their system of record applications.

(b) Every component in the stack is open source—Postgres, ClickHouse, PeerDB for native CDC, pg_clickhouse, and Ubicloud Postgres (our data plane component). We plan to keep it that way as much as possible, as this strongly aligns with our ethos.

(c)Third, as you noted, Postgres is NVMe-backed and the focus is on performance and scalability, while maintaining top-notch reliability. We think that this more meaningful to fast-growing (AI-driven) workloads than instant provisioning and forking. I talk about this a bit more here - https://clickhouse.com/blog/postgres-managed-by-clickhouse#p...

caffeinated_me•2h ago
Thanks! Out of curiosity, does the NVME have a big effect on replication throughput? I've been wondering how much trouble I've had with other solutions is due to parsing WAL and how much is just slow cloud disk
saisrirampur•2h ago
Very interesting question. Depends on the use-case, have seen quite a few workloads where logical replication gets throttled on I/O (reorder buffer) where NVMe based disk access should help a lot. This happens specifically when there are larger or interleaved transactions. We plan to test this at production scale soon. Stay tuned for more learnings!
samokhvalov•2h ago
congrats! the more postgres everywhere, the better

Show HN: TetrisBench – Gemini Flash reaches 66% win rate on Tetris against Opus

https://tetrisbench.com/tetrisbench/
46•ykhli•3h ago•21 comments

Show HN: Ourguide – OS wide task guidance system that shows you where to click

https://ourguide.ai
12•eshaangulati•3h ago•4 comments

Show HN: SF Microclimates

https://github.com/solo-founders/sf-microclimates
13•weisser•20h ago•21 comments

Show HN: Only 1 LLM can fly a drone

https://github.com/kxzk/snapbench
120•beigebrucewayne•11h ago•75 comments

Show HN: Hybrid Markdown Editing

https://tiagosimoes.github.io/codemirror-markdown-hybrid/
2•eropatori•2h ago•0 comments

Show HN: Managed Postgres with native ClickHouse integration

29•saisrirampur•4d ago•7 comments

Show HN: An interactive map of US lighthouses and navigational aids

https://www.lighthouses.app/
95•idd2•1d ago•20 comments

Show HN: TUI for managing XDG default applications

https://github.com/mitjafelicijan/xdgctl
133•mitjafelicijan•1d ago•44 comments

Show HN: Netfence – Like Envoy for eBPF Filters

https://github.com/danthegoodman1/netfence
55•dangoodmanUT•1d ago•7 comments

Show HN: A small programming language where everything is pass-by-value

https://github.com/Jcparkyn/herd
79•jcparkyn•23h ago•54 comments

Show HN: I got tired of checking 5 dashboards, so I built a simpler one

https://anypanel.io/
4•dasfelix•5h ago•0 comments

Show HN: Fence – Sandbox CLI commands with network/filesystem restrictions

https://github.com/Use-Tusk/fence
73•jy-tan•6d ago•23 comments

Show HN: Bonsplit – Tabs and splits for native macOS apps

https://bonsplit.alasdairmonk.com
241•sgottit•1d ago•33 comments

Show HN: Delegation/Mixins C# Source Generators Library

https://www.nuget.org/packages/NameHillSoftware.TypeAdoption
2•whoisthemachine•8h ago•0 comments

Show HN: NukeCast – If it happened today, where would the fallout go

https://nukecast.com/
17•todd_tracerlab•18h ago•6 comments

Show HN: WhyThere – Compare cities side-by-side to decide where to move

https://whythere.life
12•daversa•18h ago•19 comments

Show HN: LLMNet – The Offline Internet, Search the web without the web

https://github.com/skorotkiewicz/llmnet
29•modinfo•1d ago•6 comments

Show HN: Zero – Serverless ECMWF weather visualization (WebGPU)

https://zero.hypatia.earth/
3•noiv•9h ago•1 comments

Show HN: AutoShorts – Local, GPU-accelerated AI video pipeline for creators

https://github.com/divyaprakash0426/autoshorts
70•divyaprakash•1d ago•34 comments

Show HN: C From Scratch – Learn safety-critical C with prove-first methodology

https://github.com/SpeyTech/c-from-scratch
65•william1872•1d ago•10 comments

Show HN: FaceTime-style calls with an AI Companion (Live2D and long-term memory)

https://thebeni.ai/
30•summerlee9611•22h ago•14 comments

Show HN: Alprina – Intent matching for co-founders and investors

https://www.alprina.com
2•Othrya•10h ago•1 comments

Show HN: Coi – A language that compiles to WASM, beats React/Vue

221•io_eric•6d ago•69 comments

Show HN: isometric.nyc – giant isometric pixel art map of NYC

https://cannoneyed.com/isometric-nyc/
1315•cannoneyed•4d ago•240 comments

Show HN: CertRadar – Find every certificate ever issued for your domain

https://certradar.net/
20•ops_mechanic•1d ago•8 comments

Show HN: Sightline – Shodan-style search for real-world infra using OSM Data

https://github.com/ni5arga/sightline
22•ni5arga•1d ago•1 comments

Show HN: Open-source Figma design to code

https://github.com/vibeflowing-inc/vibe_figma
50•alepeak•2d ago•8 comments

Show HN: StormWatch – Weather emergency dashboard with prep checklists

https://jeisey.github.io/stormwatch/
43•lotusxblack•2d ago•11 comments

Show HN: Text-to-video model from scratch (2 brothers, 2 years, 2B params)

https://huggingface.co/collections/Linum-AI/linum-v2-2b-text-to-video
156•schopra909•4d ago•24 comments

Show HN: Nhx – Node.js Hybrid eXecutor (a uvx inspired tool)

https://www.npmjs.com/package/nhx
5•kolodny•19h ago•0 comments