frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Iceberg, the right idea – the wrong spec – Part 2 of 2: The spec

https://www.database-doctor.com/posts/iceberg-is-wrong-2.html
20•lsuresh•6h ago

Comments

ozgrakkurt•2h ago
Great analysis of what iceberg does but don’t agree with so much criticism.

It is very basic compared to a database, and even when you go into details of databases there are many things that don’t make sense in terms of doing the absolute best thing.

You could ciritisize parquet in a similar way if you go through the spec but because it is open and so popular people are going to use it no matter what.

If you need more performance/efficiency simplicity etc. just don’t use parquet but have conversion between your format and parquet.

Or you can build on top of parquet with external indices, keeping metadata in memory and having a separate WAL for consistency.

Similarly it should be possible to build on top of iceberg spec to create something like a db server that is efficient.

It is unlikely for something so usable for so many use cases to be the technically pure and most sensible option.

dkdcio•1h ago
I think this criticism is missing the order of magnitude aspect -- I agree, people do not choose the most technically pure option. But one that launches on day 1 that can be used in SQL or Python with a few lines of code, across any cloud provider, and it basically "just works" is an order of magnitude or more simple than using Iceberg, at least in my experience in Python. It's always been odd how every non-JVM client for Iceberg has supported reads, but never writes...

People don't choose on tech on technical purity, but they often chose on simplicity & ease of use

lsuresh•59m ago
Yeah that's been our biggest issue in this ecosystem (the non-JVM clients). They can't do writes and are often far behind on feature parity with the blessed JVM clients.
fifilura•1h ago
I am currently considering whether it is worth moving our stack from Hive type tables to Iceberg. Iceberg is obviously technically more competent, but the Hive tables are just so nice because the data is almost orthogonal from the tables.

You can throw away a table and recreate it in minutes and vice versa you can edit the data and the table will adapt.

I am so used to this and I am worried of loosing this flexibility with Iceberg.

Maybe a mix is the way to go.

TFA is very well written by the way. From my perspective I see Iceberg as Hive tables 2.0. Solving a lot of the Hive related problems but not all generic database problems. So all new features are positive for me.

But my only gripe is - is the added complexity worth it?

chojeen•35m ago
I really don't get a lot of this criticism. For example, who is using iceberg with hundreds of concurrent committers, especially at the scale mentioned in the article (10k rows per second)? Using iceberg or any table format over object storage would be insane in that case. But for your typical spark application, you have one main writer (the spark driver) appending or merging a large number of records in > 1 minute microbatches and maybe a handful of maintenance jobs for compaction and retention; Iceberg's concurrency system works fine there.

If you have any use case like one the author describes, maybe use an in-memory cloud database with tiered storage or a plain RDBMS. Iceberg (and similar formats) work great for the use cases for which they're designed.

Telo MT1

https://www.telotrucks.com/
239•turtleyacht•4h ago•181 comments

6 Weeks of Claude Code

https://blog.puzzmo.com/posts/2025/07/30/six-weeks-of-claude-code/
128•mike1o1•2d ago•180 comments

Helsinki records zero traffic deaths for full year

https://www.helsinkitimes.fi/finland/finland-news/domestic/27539-helsinki-records-zero-traffic-deaths-for-full-year.html
220•DaveZale•3d ago•113 comments

The Art of Multiprocessor Programming 2nd Edition Book Club

https://eatonphil.com/2025-art-of-multiprocessor-programming.html
201•eatonphil•7h ago•28 comments

I tried living on IPv6 for a day, and here's what happened

https://www.xda-developers.com/the-internet-isnt-fully-ipv6-ready/
24•speckx•2d ago•12 comments

We may not like what we become if A.I. solves loneliness

https://www.newyorker.com/magazine/2025/07/21/ai-is-about-to-solve-loneliness-thats-a-problem
289•defo10•10h ago•618 comments

Browser extension and local backend that automatically archives YouTube videos

https://github.com/andrewarrow/starchive
76•fcpguru•4h ago•29 comments

Anandtech.com now redirects to its forums

https://forums.anandtech.com/
60•kmfrk•7h ago•13 comments

Online Collection of Keygen Music

https://keygenmusic.tk
127•mifydev•3d ago•29 comments

Modeling Open-World Cognition as On-Demand Synthesis of Probabilistic Models

https://arxiv.org/abs/2507.12547
4•PaulHoule•38m ago•0 comments

At a Loss for Words: A flawed idea is teaching kids to be poor readers (2019)

https://www.apmreports.org/episode/2019/08/22/whats-wrong-how-schools-teach-reading
28•Akronymus•8h ago•37 comments

Helion begins work on Washington nuclear fusion plant

https://www.nucnet.org/news/microsoft-backed-fusion-company-begins-work-on-washington-nuclear-fusion-plant-7-4-2025
34•mpweiher•2d ago•28 comments

PixiEditor 2.0 – A FOSS universal 2D graphics editor

https://pixieditor.net/blog/2025/07/30/20-release/
70•ksymph•2d ago•7 comments

Great Question (YC W21) Is Hiring a VP of Engineering (Remote)

https://www.ycombinator.com/companies/great-question/jobs/ONBQUqe-vp-of-engineering
1•nedwin•3h ago

Show HN: WebGPU enables local LLM in the browser – demo site with AI chat

https://andreinwald.github.io/browser-llm/
93•andreinwald•6h ago•35 comments

The /o in Ruby regex stands for "oh the humanity "

https://jpcamara.com/2025/08/02/the-o-in-ruby-regex.html
95•todsacerdoti•6h ago•23 comments

Compressing Icelandic name declension patterns into a 3.27 kB trie

https://alexharri.com/blog/icelandic-name-declension-trie
182•alexharri•9h ago•69 comments

Double-slit experiment holds up when stripped to its quantum essentials

https://news.mit.edu/2025/famous-double-slit-experiment-holds-when-stripped-to-quantum-essentials-0728
26•ColinWright•2d ago•9 comments

Australia’s gains in wheat-farm productivity

https://www.reuters.com/investigations/less-rain-more-wheat-how-australian-farmers-defied-climate-doom-2025-07-29/
44•tiarafawn•3d ago•2 comments

Financial lessons from my family's experience with long-term care insurance

https://www.whitecoatinvestor.com/financial-lessons-father-long-term-care-insurance/
84•wallflower•6h ago•87 comments

Linear Types for Programmers (2023)

https://twey.io/for-programmers/linear-types/
30•marvinborner•4h ago•4 comments

A.I. researchers are negotiating $250M pay packages

https://www.nytimes.com/2025/07/31/technology/ai-researchers-nba-stars.html
119•jrwan•9h ago•206 comments

ThinkPad designer David Hill on unreleased models

https://www.theregister.com/2025/08/02/thinkpad_david_hill_interview/
129•LorenDB•8h ago•53 comments

A dive into open chat protocols

https://wiki.alopex.li/ADiveIntoOpenChat
56•Bogdanp•3d ago•8 comments

The Rubik's Cube Perfect Scramble (2024)

https://www.solutionslookingforproblems.com/post/the-rubik-s-cube-perfect-scramble
78•notagoodidea•6h ago•22 comments

The Big Oops in type systems: This problem extends to FP as well

https://danieltan.weblog.lol/2025/07/the-big-oops-in-type-systems-this-problem-extends-to-fp-as-well
36•ksymph•2d ago•10 comments

TclSqueak – Program in Tcl the Smalltalk Way

http://www.xdobry.de/tclsqueak/
5•ofalkaed•2d ago•0 comments

Introduction to Unikernel: Building, deploying lightweight, secure applications

https://tallysolutions.com/technology/introduction-to-unikernel-2/
14•eyberg•1d ago•7 comments

Write "Freehold" Software

https://deadbeef.io/freehold_software
42•rjinman•1w ago•15 comments

Show HN: Wordle-style game for Fermi questions

https://www.fermiquestions.org/
22•danielfetz•3h ago•23 comments