frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Apache Iceberg V3 Spec new features for more efficient and flexible data lakes

https://opensource.googleblog.com/2025/08/whats-new-in-iceberg-v3.html
47•talatuyarer•3h ago

Comments

talatuyarer•2h ago
This new version has some great new features, including deletion vectors for more efficient transactions and default column values to make schema evolution a breeze. The full article has all the details.
hodgesrm•2h ago
This Google article was nice as a high level overview of Iceberg V3. I wish that the V3 spec (and Iceberg specs in general) were more readable. For now the best approach seems to be read the Javadoc for the Iceberg Java API. [0]

[0] https://javadoc.io/doc/org.apache.iceberg/iceberg-api/latest...

twoodfin•58m ago
The Iceberg spec is a model of clarity and simplicity compared to the (constantly in flux via Databricks commits…) Delta protocol spec:

https://github.com/delta-io/delta/blob/master/PROTOCOL.md

ahmetburhan•1h ago
Cool to see Iceberg getting these kinds of upgrades. Deletion vectors and default column values sound like real quality-of-life improvements, especially for big, messy datasets. Curious to hear if anyone’s tried V3 in production yet and what the performance looks like.
amluto•1h ago
> ALTER TABLE events ADD COLUMN version INT DEFAULT 1;

I’ve always disliked this approach. It conflates two things: the value to put in preexisting rows and the default going forward. I often want to add a column, backfill it, and not have a default.

Fortunately, the Iceberg spec at least got this right under the hood. There’s “initial-default”, which is the value implicitly inserted in rows that predate the addition of the column, and there’s “write-default”, which is the default for new rows.

drivenextfunc•59m ago
Many companies seem to be using Apache Iceberg, but the ecosystem feels immature outside of Java. For instance, iceberg-rust doesn't even support HDFS. (Though admittedly, Iceberg's tendency to create many small files makes it a poor fit for HDFS anyway.)
hodgesrm•34m ago
Seems like this is going to be a permanent issue, no? Library level storage APIs are complex and often quite leaky. That's based on looking at the innards of MySQL and ClickHouse for a while.

It seems quite possible that there will be maybe three libraries that can write to Iceberg (Java, Python, Rust, maybe Golang), while the rest at best will offer read access only. And those language choices will condition and be conditioned by the languages that developers use to write applications that manage Iceberg data.

Wikipedia loses challenge against Online Safety Act

https://www.bbc.com/news/articles/cjr11qqvvwlo
170•phlummox•4h ago•278 comments

I tried every todo app and ended up with a .txt file

https://www.al3rez.com/todo-txt-journey
560•al3rez•6h ago•389 comments

Neki – sharded Postgres by the team behind Vitess

https://planetscale.com/blog/announcing-neki
66•thdxr•2h ago•4 comments

GitHub is no longer independent at Microsoft after CEO resignation

https://www.theverge.com/news/757461/microsoft-github-thomas-dohmke-resignation-coreai-team-transition
694•Handy-Man•4h ago•461 comments

Claude Is the Drug, Cursor Is the Dealer

https://middlelayer.substack.com/p/i-claude-is-the-drug-cursor-is-the
105•logan1085•4h ago•66 comments

OpenSSH Post-Quantum Cryptography

https://www.openssh.com/pq.html
282•throw0101d•8h ago•84 comments

Byte Buddy is a code generation and manipulation library for Java

https://bytebuddy.net/
37•mooreds•3d ago•14 comments

The Joy of Mixing Custom Elements, Web Components, and Markdown

https://deanebarker.net/tech/blog/custom-elements-markdown/
49•deanebarker•4h ago•16 comments

The Value of Institutional Memory

https://timharford.com/2025/05/the-value-of-institutional-memory/
59•leoc•3h ago•27 comments

Pricing Pages – A Curated Gallery of Pricing Page Designs

https://pricingpages.design/
146•finniansturdy•8h ago•42 comments

How Boom uses software to accelerate hardware development

https://bscholl.substack.com/p/move-fast-and-dont-break-safety-critical
31•flabber•1d ago•11 comments

UI vs. API. vs. UAI

https://www.joshbeckman.org/blog/practicing/ui-vs-api-vs-uai
42•bckmn•4h ago•18 comments

Trellis (YC W24) Is Hiring: Automate Prior Auth in Healthcare

https://www.ycombinator.com/companies/trellis/jobs/Cv3ZwXh-forward-deployed-engineers-all-levels-august-2025
1•jackylin•3h ago

Learn, Reflect, Apply, Prepare: The Four Daily Practices That Changed How I Live

https://opuslabs.substack.com/p/learn-reflect-apply-prepare
31•opuslabs•4h ago•3 comments

Claude Code is all you need

https://dwyer.co.za/static/claude-code-is-all-you-need.html
342•sixhobbits•6h ago•214 comments

The Chrome VRP Panel has decided to award $250k for this report

https://issues.chromium.org/issues/412578726
454•alexcos•14h ago•244 comments

White Mountain Direttissima

https://whitemountainski.co/pages/white-mountain-direttissima
18•oftenwrong•3d ago•7 comments

36B solar mass black hole at centre of the Cosmic Horseshoe gravitational lens

https://academic.oup.com/mnras/article/541/4/2853/8213862?login=false
81•bookofjoe•5h ago•57 comments

AP to end its weekly book reviews

https://dankennedy.net/2025/08/08/the-associated-press-tells-its-book-critics-that-its-ending-weekly-reviews/
59•thm•3h ago•20 comments

Launch HN: Halluminate (YC S25) – Simulating the internet to train computer use

26•wujerry2000•5h ago•24 comments

A Guide Dog for the Face-Blind

https://asimov.blog/a-guide-dog-for-the-face-blind/
5•arto•3d ago•1 comments

Porting to OS/2 – GitPius

https://gitpi.us/article-archive/porting-to-os2/
34•rbanffy•4d ago•0 comments

Designing Software in the Large

https://dafoster.net/articles/2025/07/22/designing-software-in-the-large/
52•davidfstr•6h ago•18 comments

Faster substring search with SIMD in Zig

https://aarol.dev/posts/zig-simd-substr/
161•todsacerdoti•10h ago•48 comments

Mistral Integration Improved in Llama.cpp

https://github.com/ggml-org/llama.cpp/pull/14737
70•decide1000•10h ago•3 comments

Token growth indicates future AI spend per dev

https://blog.kilocode.ai/p/future-ai-spend-100k-per-dev
148•twapi•2h ago•119 comments

Apache Iceberg V3 Spec new features for more efficient and flexible data lakes

https://opensource.googleblog.com/2025/08/whats-new-in-iceberg-v3.html
47•talatuyarer•3h ago•7 comments

Optimizing my sleep around Claude usage limits

https://mattwie.se/no-sleep-till-agi
95•mattwiese•19h ago•80 comments

A simple pixel physics simulator in Rust using Macroquad

https://github.com/gale93/sbixel
36•sbirulo•4d ago•1 comments

Ollama and gguf

https://github.com/ollama/ollama/issues/11714
10•indigodaddy•2h ago•4 comments