Distributed DuckDB Instance

44•citguru•2h ago

Comments

citguru•2h ago

This is an attempt to replicate MotherDucks differential storage and implement hybrid query execution on DuckDB

zurfer•1h ago

As someone working in the field I have to admit that I'm not familiar with the terms differential storage nor do I really understand what hybrid execution means. Maybe you could describe it both from a simple technical point of view and what benefits it has to me as a user?

nehalem•1h ago

I have a deep appreciation for DuckDB, but I am afraid the confluence of brilliant ideas makes it ever more complicated to adopt —- and DuckLake is another example for this trend.

When I look at SQLite I see a clear message: a database in a file. I think DuckDb is that, too. But it’s also an analytics engine like Polars, works with other DB engines, supports Parquet, comes with a UI, has two separate warehouse ideas which both deviate from DuckDB‘s core ideas.

Yes, DuckLake and Motherduck are separate entities, but they are still part of the ecosystem.

Lucasoato•1h ago

Last week I’ve sent my first PR in duckdb to support iceberg views in catalogs like Polaris! Let’s hope for the best :)

herpderperator•1h ago

Does this help with DuckDB concurrency? My main gripe with DuckDB is that you can't write to it from multiple processes at the same time. If you open the database in write mode with one process, you cannot modify it at all from another process without the first process completely releasing it. In fact, you cannot even read from it from another process in this scenario.

So if you typically use a file-backed DuckDB database in one process and want to quickly modify something in that database using the DuckDB CLI (like you might connect SequelPro or DBeaver to make changes to a DB while your main application is 'using' it), then it complains that it's locked by another process and doesn't let you connect to it at all.

This is unlike SQLite, which supports and handles this in a thread-safe manner out of the box. I know it's DuckDB's explicit design decision[0], but it would be amazing if DuckDB could behave more like SQLite when it comes to this sort of thing. DuckDB has incredible quality-of-life improvements with many extra types and functions supported, not to mention all the SQL dialect enhancements allowing you to type much more concise SQL (they call it "Friendly SQL"), which executes super efficiently too.

[0] https://duckdb.org/docs/current/connect/concurrency

szarnyasg•1h ago

Hi, DuckDB DevRel here. To have concurrent read-write access to a database, you can use our DuckLake lakehouse format and coordinate concurrent access through a shared Postgres catalog. We released v1.0 yesterday: https://ducklake.select/2026/04/13/ducklake-10/

I updated your reference [0] with this information.

oulipo2•56m ago

Seems cool! But would be nice to have some "real-world" use cases to see actual usage patterns...

In my case my systems can produce "warnings" when there are some small system warning/errors, that I want to aggregate and review (drill-down) from time to time

I was hesitating between using something like OpenTelemetry to send logs/metrics for those, or just to add a "warnings" table to my Timescaledb and use some aggregates to drill them down and possibly display some chunks to review...

but another possibility, to avoid using Timescaledb/clickhouse and just rely on S3 would be to upload those in a parquet file on a bucket through duckdb, and then query them from time to time to have stats

Would you have a recommendation?

Wailbrew – Minimalistic Homebrew GUI Made with Go, Wails and React

Valgrind 3.27 RC1 is out

Operation Paperclip

Audio Flamingo Next: Open audio-language models for speech, sound, and music

Whisk AI

Track Historical GitHub Repo Metrics in Slack and Git

Call Me a Jerk: Persuading AI to Comply with Objectionable Requests

Ransomware Is Growing Three Times Faster Than the Spending Meant to Stop It

Unit 731

No one can force me to have a secure website!!!

Show HN: How unique is your combination of interests among 8B people?

Show HN: I analyzed 591 agentic engineering jobs: LangChain dominates at 22%

Show HN: A CLI that writes its own integration code

Show HN: iOS app that continuously turns contact dates into calendar events

EU-backed manufacturing body goes bust – and no one will say why

I built a bot that tests every interesting HN app daily so I don't have to

Two-Stage Semantic Chunking for RAG in Python

An AI Vibe Coding Horror Story

Compare harnesses not models: Blitzy vs. GPT-5.4 on SWE-Bench Pro

LiquidClash – A native macOS proxy client with Liquid Glass UI

Backblaze has stopped backing up your data

Show HN: A stateful UI runtime for reactive web apps in Go

The Complete Guide to React Native Build Optimization

I turned my Wi-Fi network into a presence sensor

Steven Heller's Font of the Month: Gilway Paradox

Think the Iran war is a disaster? Blame these DC think tanks first

Jarvis – governed AI control plane with receipts, rollback, and agent guardrails

The README for this Java library is something else

Udpown.io – Simple Website Monitoring

The Internet's Most Powerful Archiving Tool Is in Peril