Frozen DuckLakes for Multi-User, Serverless Data Access

https://ducklake.select/2025/10/24/frozen-ducklake/

29•g0xA52A2A•5d ago

Comments

gopalv•2h ago

The useful part is that duckdb is so easy to use as a client with an embedded server, because duckdb is a great client (+ a library).

Similar to how git can serve a repo from a simple http server with no git installed on that (git update-server-info).

The frozen part is what iceberg promised in the beginning, away from Hive's mutable metastore.

Point to a manifest file + parquet/orc & all you need to query it is S3 API calls (there is no metadata/table server, the server is the client).

> Creating and publishing a Frozen DuckLake with about 11 billion rows, stored in 4,030 S3-based Parquet files took about 22 minutes on my MacBook

Hard to pin down how much of it is CPU and how much is IO from s3, but doing something like HLL over all the columns + rows is pretty heavy on the CPU.

ryanschneider•1h ago

Even cooler, let's say you need to "update" a subset of your parquet files after they are written. Once you have your parquet files in a ducklake, you can "virtually" update them (the files themselves aren't touched, just new ones created). Something like:

- create your frozen ducklake

- run whatever "normal" mutation query you want to run (DELETE, UPDATE, MERGE INTO)

- use `ducklake_rewrite_data_files` to make new files w/ mutations applied, then optionally run `ducklake_merge_adjacent_files` to compact the files as well (though this might cause all files to change).

- call `ducklake_list_files` to get the new set of active files.

- update your upstream "source of truth" with this new list, optionally deleting any files no longer referenced.

The net result should be that any files "touched" by your updates will have new updated versions alongside them, while any that were unchanged should just be returned in the list files operation as is.

Affinity Studio now free

The ear does not do a Fourier transform

Launch HN: Propolis (YC X25) – Browser agents that QA your web app autonomously

987654321 / 123456789

Show HN: I made a heatmap diff viewer for code reviews

Free software scares normal people

UDP isn't unreliable, it's a convertible (2024)

Minecraft HDL, an HDL for Redstone

Rapid Brightening of 3I/Atlas Ahead of Perihelion

ZOZO's Contact Solver for physics-based simulations

I have released a 69.0MB version of Windows 7 x86

TruthWave – A Platform for Corporate Whistleblowers

Some people can't see mental images

US declines to join more than 70 countries in signing UN cybercrime treaty

Zig's New Async I/O

Show HN: In a single HTML file, an app to encourage my children to invest

Qt Creator 18 Released

Springs and bounces in native CSS

Israel demanded Google and Amazon use secret 'wink' to sidestep legal orders

Learn Multiplatform Z80 Assembly Programming with Vampires

Frozen DuckLakes for Multi-User, Serverless Data Access

Replacing EBS and Rethinking Postgres Storage from First Principles

NaN, the not-a-number number that isn't NaN

Tell HN: Azure outage

Minecraft removing obfuscation in Java Edition

You can't turn off Copilot in the web versions of Word, Excel, or PowerPoint

Aisuru botnet shifts from DDoS to residential proxies

Acronymy (Can we define every word as an acronym?)

Spinning Up an Onion Mirror Is Stupid Easy

Typst's Math Mode Problem

Frozen DuckLakes for Multi-User, Serverless Data Access

Comments

Affinity Studio now free

The ear does not do a Fourier transform

Launch HN: Propolis (YC X25) – Browser agents that QA your web app autonomously

987654321 / 123456789

Show HN: I made a heatmap diff viewer for code reviews

Free software scares normal people

UDP isn't unreliable, it's a convertible (2024)

Minecraft HDL, an HDL for Redstone

Rapid Brightening of 3I/Atlas Ahead of Perihelion

ZOZO's Contact Solver for physics-based simulations

I have released a 69.0MB version of Windows 7 x86

TruthWave – A Platform for Corporate Whistleblowers

Some people can't see mental images

US declines to join more than 70 countries in signing UN cybercrime treaty

Zig's New Async I/O

Show HN: In a single HTML file, an app to encourage my children to invest

Qt Creator 18 Released

Springs and bounces in native CSS

Israel demanded Google and Amazon use secret 'wink' to sidestep legal orders

Learn Multiplatform Z80 Assembly Programming with Vampires

Frozen DuckLakes for Multi-User, Serverless Data Access

Replacing EBS and Rethinking Postgres Storage from First Principles

NaN, the not-a-number number that isn't NaN

Tell HN: Azure outage

Minecraft removing obfuscation in Java Edition

You can't turn off Copilot in the web versions of Word, Excel, or PowerPoint

Aisuru botnet shifts from DDoS to residential proxies

Acronymy (Can we define every word as an acronym?)

Spinning Up an Onion Mirror Is Stupid Easy

Typst's Math Mode Problem