frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Distributed DuckDB Instance

https://github.com/citguru/openduck
53•citguru•3h ago

Comments

citguru•3h ago
This is an attempt to replicate MotherDucks differential storage and implement hybrid query execution on DuckDB
zurfer•2h ago
As someone working in the field I have to admit that I'm not familiar with the terms differential storage nor do I really understand what hybrid execution means. Maybe you could describe it both from a simple technical point of view and what benefits it has to me as a user?
nehalem•2h ago
I have a deep appreciation for DuckDB, but I am afraid the confluence of brilliant ideas makes it ever more complicated to adopt —- and DuckLake is another example for this trend.

When I look at SQLite I see a clear message: a database in a file. I think DuckDb is that, too. But it’s also an analytics engine like Polars, works with other DB engines, supports Parquet, comes with a UI, has two separate warehouse ideas which both deviate from DuckDB‘s core ideas.

Yes, DuckLake and Motherduck are separate entities, but they are still part of the ecosystem.

Lucasoato•2h ago
Last week I’ve sent my first PR in duckdb to support iceberg views in catalogs like Polaris! Let’s hope for the best :)
herpderperator•2h ago
Does this help with DuckDB concurrency? My main gripe with DuckDB is that you can't write to it from multiple processes at the same time. If you open the database in write mode with one process, you cannot modify it at all from another process without the first process completely releasing it. In fact, you cannot even read from it from another process in this scenario.

So if you typically use a file-backed DuckDB database in one process and want to quickly modify something in that database using the DuckDB CLI (like you might connect SequelPro or DBeaver to make changes to a DB while your main application is 'using' it), then it complains that it's locked by another process and doesn't let you connect to it at all.

This is unlike SQLite, which supports and handles this in a thread-safe manner out of the box. I know it's DuckDB's explicit design decision[0], but it would be amazing if DuckDB could behave more like SQLite when it comes to this sort of thing. DuckDB has incredible quality-of-life improvements with many extra types and functions supported, not to mention all the SQL dialect enhancements allowing you to type much more concise SQL (they call it "Friendly SQL"), which executes super efficiently too.

[0] https://duckdb.org/docs/current/connect/concurrency

szarnyasg•1h ago
Hi, DuckDB DevRel here. To have concurrent read-write access to a database, you can use our DuckLake lakehouse format and coordinate concurrent access through a shared Postgres catalog. We released v1.0 yesterday: https://ducklake.select/2026/04/13/ducklake-10/

I updated your reference [0] with this information.

oulipo2•1h ago
Seems cool! But would be nice to have some "real-world" use cases to see actual usage patterns...

In my case my systems can produce "warnings" when there are some small system warning/errors, that I want to aggregate and review (drill-down) from time to time

I was hesitating between using something like OpenTelemetry to send logs/metrics for those, or just to add a "warnings" table to my Timescaledb and use some aggregates to drill them down and possibly display some chunks to review...

but another possibility, to avoid using Timescaledb/clickhouse and just rely on S3 would be to upload those in a parquet file on a bucket through duckdb, and then query them from time to time to have stats

Would you have a recommendation?

decide1000•33m ago
I built a distributed DuckDB setup using OpenRaft for state replication. Every node holds a full copy of the database. Writes go through Raft consensus, reads are local. It's more like etcd-with-DuckDB than MotherDuck-lite.

OpenDuck takes a different approach with query federation with a gateway that splits execution across local and remote workers. My use case requires every node to serve reads independently with zero network latency, and to keep running if other nodes go down.

The PostgreSQL dependency for metadata feels heavy. Now you're operating two database systems instead of one. In my setup DuckDB stores both the Raft log and the application data, so there's a single storage engine to reason about.

Not saying my approach is universally better. If you need to query across datasets that don't fit on a single machine, OpenDuck's architecture makes more sense. But if you want replicated state with strong consistency, Raft + DuckDB works very well.

arpinum•22m ago
I read the code. It's a good case study of one-shot output from AI when you ask it to replicate a SaaS product. This is probably better than most because MotherDuck has been open about their techniques to build the product.

Obviously not a production implementation.

DaVinci Resolve – Photo

https://www.blackmagicdesign.com/products/davinciresolve/photo
556•thebiblelover7•7h ago•136 comments

A new spam policy for “back button hijacking”

https://developers.google.com/search/blog/2026/04/back-button-hijacking
344•zdw•6h ago•208 comments

An AI Vibe Coding Horror Story

https://www.tobru.ch/an-ai-vibe-coding-horror-story/
90•teichmann•1h ago•57 comments

Backblaze has stopped backing up your data

https://rareese.com/posts/backblaze/
38•rrreese•1h ago•13 comments

Introspective Diffusion Language Models

https://introspective-diffusion.github.io/
32•zagwdt•1h ago•7 comments

Someone bought 30 WordPress plugins and planted a backdoor in all of them

https://anchor.host/someone-bought-30-wordpress-plugins-and-planted-a-backdoor-in-all-of-them/
940•speckx•15h ago•265 comments

GitHub Stacked PRs

https://github.github.com/gh-stack/
711•ezekg•13h ago•374 comments

The secrets of the Shinkansen

https://www.worksinprogress.news/p/the-secret-behind-japans-railways
76•WillDaSilva•3h ago•56 comments

Distributed DuckDB Instance

https://github.com/citguru/openduck
53•citguru•3h ago•9 comments

Lean proved this program correct; then I found a bug

https://kirancodes.me/posts/log-who-watches-the-watchers.html
247•bumbledraven•9h ago•119 comments

WiiFin – Jellyfin Client for Nintendo Wii

https://github.com/fabienmillet/WiiFin
158•throwawayk7h•10h ago•69 comments

A soft robot has no problem moving with no motor and no gears

https://engineering.princeton.edu/news/2026/04/08/soft-robot-has-no-problem-moving-no-motor-and-n...
30•hhs•4d ago•3 comments

Multi-Agentic Software Development Is a Distributed Systems Problem

https://kirancodes.me/posts/log-distributed-llms.html
35•tie-in•4h ago•8 comments

MOS tech 6502 8-bit microprocessor in pure SQL powered by Postgres

https://github.com/lasect/pg_6502
19•adunk•3h ago•2 comments

Design and implementation of DuckDB internals

https://duckdb.org/library/design-and-implementation-of-duckdb-internals/
121•mpweiher•3d ago•8 comments

Nothing Ever Happens: Polymarket bot that always buys No on non-sports markets

https://github.com/sterlingcrispin/nothing-ever-happens
418•m-hodges•18h ago•234 comments

TanStack Start Now Support React Server Components

https://tanstack.com/blog/react-server-components
58•polywock•4h ago•44 comments

Roblox devs now need a subscription to share their games freely

https://devforum.roblox.com/t/new-publishing-requirements-evaluation-process-for-games/4573166
50•hallole•3h ago•45 comments

Rust Threads on the GPU

https://www.vectorware.com/blog/threads-on-gpu/
70•PaulHoule•4d ago•21 comments

US appeals court declares 158-year-old home distilling ban unconstitutional

https://nypost.com/2026/04/11/us-news/us-appeals-court-declares-158-year-old-home-distilling-ban-...
385•t-3•20h ago•257 comments

Lumina – a statically typed web-native language for JavaScript and WASM

https://github.com/nyigoro/lumina-lang
10•light_ideas•4d ago•3 comments

N-Day-Bench – Can LLMs find real vulnerabilities in real codebases?

https://ndaybench.winfunc.com
69•mufeedvh•11h ago•18 comments

Make tmux pretty and usable (2024)

https://hamvocke.com/blog/a-guide-to-customizing-your-tmux-conf/
380•speckx•18h ago•235 comments

Anastasia (1997) live action reference material

https://lostmediawiki.com/Anastasia_(partially_found_live-action_reference_material_for_Don_Bluth...
26•hyperific•3d ago•3 comments

How to make Firefox builds 17% faster

https://blog.farre.se/posts/2026/04/10/caching-webidl-codegen/
180•mbitsnbites•14h ago•34 comments

Repository Pattern with Hygienic Macros in Scheme – Lisp

https://jointhefreeworld.org/blog/articles/lisps/functional-repository-pattern-in-scheme-with-mac...
8•jjba23•3d ago•0 comments

Write less code, be more responsible

https://blog.orhun.dev/code-responsibly/
90•orhunp_•3d ago•56 comments

Building a CLI for all of Cloudflare

https://blog.cloudflare.com/cf-cli-local-explorer/
308•soheilpro•18h ago•100 comments

Android now stops you sharing your location in photos

https://shkspr.mobi/blog/2026/04/android-now-stops-you-sharing-your-location-in-photos/
369•edent•21h ago•297 comments

I just want simple S3

https://blog.feld.me/posts/2026/04/i-just-want-simple-s3/
178•g0xA52A2A•2d ago•89 comments