frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN submissions tripled and now mostly have the same vibe-coded look

https://www.adriankrebs.ch/blog/design-slop/
98•hubraumhugo•1h ago•56 comments

Windows 9x Subsystem for Linux

https://social.hails.org/@hailey/116446826733136456
530•sohkamyung•5h ago•136 comments

Our eighth generation TPUs: two chips for the agentic era

https://blog.google/innovation-and-ai/infrastructure-and-cloud/google-cloud/eighth-generation-tpu...
179•xnx•3h ago•102 comments

3.4M Solar Panels

https://tech.marksblogg.com/american-solar-farms-v2.html
167•marklit•3h ago•106 comments

Treetops glowing during storms captured on film for first time

https://www.psu.edu/news/earth-and-mineral-sciences/story/treetops-glowing-during-storms-captured...
78•t-3•2h ago•14 comments

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model

https://qwen.ai/blog?id=qwen3.6-27b
80•mfiguiere•2h ago•45 comments

GitHub CLI now collects pseudoanonymous telemetry

https://cli.github.com/telemetry
219•ingve•3h ago•175 comments

Columnar Storage Is Normalization

https://buttondown.com/jaffray/archive/columnar-storage-is-normalization/
46•ibobev•3h ago•20 comments

Making RAM at Home [video]

https://www.youtube.com/watch?v=h6GWikWlAQA
499•kaipereira•1d ago•140 comments

ChatGPT Images 2.0

https://openai.com/index/introducing-chatgpt-images-2-0/
954•wahnfrieden•20h ago•833 comments

How does GPS work?

https://perthirtysix.com/how-the-heck-does-gps-work
135•alfanick•6h ago•30 comments

DuckDB 1.5.2 – SQL database that runs on laptop, server, in the browser

https://duckdb.org/2026/04/13/announcing-duckdb-152
38•janandonly•58m ago•4 comments

Kernel code removals driven by LLM-created security reports

https://lwn.net/Articles/1068928/
76•edward•3h ago•58 comments

XOR'ing a register with itself is the idiom for zeroing it out. Why not sub?

https://devblogs.microsoft.com/oldnewthing/20260421-00/?p=112247
124•ingve•9h ago•141 comments

Another Day Has Come

https://daringfireball.net/2026/04/another_day_has_come
87•ndr42•18h ago•80 comments

Monitor your Pi / OMP sessions

https://github.com/BlackBeltTechnology/pi-agent-dashboard
14•ankitg12•3d ago•1 comments

MuJoCo – Advanced Physics Simulation

https://github.com/google-deepmind/mujoco
78•modinfo•3d ago•15 comments

All your agents are going async

https://zknill.io/posts/all-your-agents-are-going-async/
99•zknill•2d ago•61 comments

Prefill-as-a-Service:KVCache of Next-Generation Models Could Go Cross-Datacenter

https://arxiv.org/abs/2604.15039
28•matt_d•3d ago•1 comments

Contact Lens Uses Microfluidics to Monitor and Treat Glaucoma

https://spectrum.ieee.org/smart-contact-lens-glaucoma-microfluidics
80•pseudolus•3d ago•2 comments

Expansion Artifacts

https://mattstromawn.com/writing/expansion-artifacts/
16•tobr•1d ago•1 comments

Drunk post: Things I've learned as a senior engineer (2021)

https://luminousmen.substack.com/p/drunk-post-things-ive-learned-as
235•zdw•15h ago•176 comments

Garbage Collection Without Unsafe Code

https://fitzgen.com/2024/02/06/safe-gc.html
86•foota•3d ago•32 comments

Windows Server 2025 Runs Better on ARM

https://jasoneckert.github.io/myblog/server-2025-arm64/
164•jasoneckert•3d ago•124 comments

The Vercel breach: OAuth attack exposes risk in platform environment variables

https://www.trendmicro.com/en_us/research/26/d/vercel-breach-oauth-supply-chain.html
350•queenelvis•22h ago•114 comments

CATL's new LFP battery can charge from 10 to 98% in less than 7 minutes

https://arstechnica.com/cars/2026/04/catls-new-lfp-battery-can-charge-from-10-to-98-in-less-than-...
99•PotatoNinja•4h ago•43 comments

Nobody Got Fired for Uber's $8M Ledger Mistake?

https://news.alvaroduran.com/p/nobody-got-fired-for-ubers-8-million
88•ohduran•4h ago•64 comments

SpaceX says it has agreement to acquire Cursor for $60B

https://twitter.com/spacex/status/2046713419978453374
743•dmarcos•17h ago•898 comments

Acetaminophen vs. ibuprofen

https://asteriskmag.com/issues/14/the-mystery-in-the-medicine-cabinet
581•nkurz•2d ago•373 comments

Meta to start capturing employee mouse movements, keystrokes for AI training

https://www.reuters.com/sustainability/boards-policy-regulation/meta-start-capturing-employee-mou...
717•dlx•22h ago•476 comments
Open in hackernews

Columnar Storage Is Normalization

https://buttondown.com/jaffray/archive/columnar-storage-is-normalization/
45•ibobev•3h ago

Comments

immanuwell•2h ago
The normalization analogy is genuinely clever as a teaching tool, but it quietly papers over the fact that normalization is a logical design concept while columnar storage is a physical one - treating them as the same thing can mislead more than it clarifies, I think
hilariously•2h ago
Fair, but one of the big benefits of normalization was the benefit on storage and memory back in the day which was tiny comparatively.

There's always a reason for a dev to ship something shitty but when you show you can use 80% less storage for the same operation you can make the accountants your lever.

jklowden•1h ago
Nonsense. See Codd’s first paper.

1NF removes repeating groups, putting for example data for each month in its own row, not an array of 12 months in 1 row.

Storage efficiency was never the point. IMS had that locked down. Succinctness of expression and accuracy of results was the point. And is: normalization prevents anomalous results.

sgarland•1h ago
I think parent was saying it’s a benefit, not the original purpose. If I store a FK to a table containing my company’s corporate address, that is a tremendous savings in storage (and memory pressure), and it also eliminates update anomalies.
cmrdporcupine•9m ago
And other people are saying it may not always have that effect.

Normalization ultimately boils down to breaking your data down into the most elemental and simplest "facts" about them, and its greatest value is how it allows and encourages more flexible and disparate ways of looking at the same information.

HelloNurse•1h ago
Normalizing repeating groups doesn't offer significant savings when they are completely populated (e.g. each entity has the full 12 monthly values per year), but other types of normalization do. For example dependent data are actually redundant.
bazoom42•51m ago
The purpose of normalization is not to save storage. In fact it might often require more storage, since it involves introducing a foreign-key column. It really depends on the data in question whether it saves storage or require more.
hilariously•24m ago
Fair, I said one of the big benefits, not the purpose - in some cases it can require more storage (but that storage is often more amenable to compression) -but generally deduplicating your data doesn't increase your storage needs.
jerf•2h ago
I've always preferred to think of normalization as more about "removing redundancy" than in the frame it is normally presented. Or, to put it another way, rather than "normalizing" which has as a benefit "removing redundancy", raise the removing of redundancy up to the primary goal which has as a side benefit "normalization".

A nice thing about that point of view is that it fits with your point; redundancy is redundancy whether you look at it with a column-based view or a row-based view.

orangepanda•2h ago
Is this meant to be a poor explanation of sixth normal form?
sgarland•1h ago
THANK YOU. I was confused at the normalization example given, and had to think through it. (id, name, age) is already at 5NF, and the only one it doesn’t satisfy is 6NF.
Lucasoato•2h ago
This is an interesting thought, even if it doesn’t come with practical consequences. A person could argue that if you happen to encode your table with a columnar format, you very likely won’t use indexes for every “value” but the order itself of that specific block. But this would mean that if you’re using the data order meaningfully, you’d probably going against the principles of table normalization. But, again, this one as well can be considered the result of excessive overthinking rather something practical that can be used.
parpfish•2h ago
I always thought that the biggest benefit of normalization was deduplicating mutable values so you only need to update values in one place and everything stays nicely in sync.

Classic example being something like a “users” table that tracks account id, display name (mutable), and profile picture (mutable). And then a “posts” table that has post id, account id, and message text. This allows you to change the display name/picture in one place and it can be used across all posts

sgarland•1h ago
That is a practical benefit, absolutely. A different way of looking at it is that you’re eliminating data anomalies (generally, update anomalies).
bazoom42•1h ago
This is usually the case when talking about normalization in the contex of relational databases (2nd normal form, 3rd normal form etc.). But normalization really just means to bring data into some standardized form.
pwndByDeath•1h ago
None-or-many?
juancn•1h ago
It is possible to treat as purely relational but it can be suboptimal on data access if you follow through with it.

The main cost is on the join when you need to access several columns, it's flexible but expensive.

To take full advantage of columnar, you have to have that join usually implicitly made through data alignment to avoid joining.

For example, segment the tables in chunks of up to N records, and keep all related contiguous columns of that chunk so they can be independently accessed:

    r0, r1 ... rm; f0, f0 ... f0; f1, f1 ... f1; fn, fn ... fn
That balances pointer chasing and joining, you can avoid the IO by only loading needed columns from the segment, and skip the join because the data is trivially aligned.
remywang•1h ago
This is exactly domain key normal form!

https://en.wikipedia.org/wiki/Domain-key_normal_form

bazoom42•20m ago
Not exactly. It is 6th normal form.
data-ottawa•51m ago
The Apache Arrow array format docs are a great read if you're interested by this blog post.