frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

We recovered from nightmare Postgres corruption on the matrix.org homeserver

https://matrix.org/blog/2025/07/postgres-corruption-postmortem/
18•Arathorn•7h ago

Comments

fowl2•5h ago
Seems like there’s a few places Postgres could benefit from some more consistency checks.
Arathorn•4h ago
we could have run with https://www.postgresql.org/docs/current/app-pgchecksums.html turned on, but it slows things down a bunch - and turning it on in retrospect would have taken days. Also not clear that it would have caught whatever the underlying corruption was here…
anarazel•2h ago
Easier said than done in this case. Actually effective crosschecks preventing this issue from occurring would entail rather massive I/O and CPU amplification in common operations.
anarazel•2h ago
A few questions:

- Are you using pg_repack? I'm fairly sure its logic has some holes - last time I checked its bug tracker listed potential for data corruption that could cause issues like this.

- Have you done OS upgrades? Did affected indexes have any columns affected by collations?

- Have you done analysis on the heap page? E.g. is there any valid data on the page? What is the page's LSN compared to the LSN on index pages pointing to non-existing tuples on the page?

dap•10m ago
The post appears to conclude that this must be a hardware issue because they have no explanation and PostgreSQL and the kernel are too reliable to have data corruption bugs. I've seen data corruption bugs in both databases and the kernel (as well as CPUs, for that matter), so I'm pretty skeptical of that explanation.

When something "can't happen" in your program, it makes sense to look at the layers below. Unfortunately, this often goes one of two ways: you ask people for help and they tell you that it's never one of the layers below ("it's never a compiler bug") or you stop at the conclusion "well, I guess the layer below [kernel/TCP/database/etc.] gave us corrupted data". The conclusion in this post kind of does both of these things. Of course, sometimes it _is_ a bug in one of those layers. But stopping there is no good either, especially when the application itself is non-trivial and you have no evidence that a lower layer is at fault.

People often treat a hypothesis like "the disk corrupted the data" as unfalsifiable. After the fact, that might be true, given the stack you're using. But that doesn't have to be the case. If you ran into a problem like this on ZFS, for example, you'd have very high confidence about whether the disk was at fault (because it can reliably detect when the disk returns data different from what ZFS wrote to it). I realize a lot goes into choosing a storage stack and maybe ZFS doesn't make sense for them. But if the hypothesis is that such a severe issue resulted from a hardware/firmware failure, I'd look pretty hard at deploying a stack that can reliably identify such failures. At the very least, if you see this again, you'll either know for sure it was the disk or you'll have high confidence that there's a software bug lurking elsewhere. Then you can add similar kinds of verification at different layers of the stack to narrow down the problem. In an ideal world, all the software should be able to help exonerate itself.

Denmark's Dark Secret: How 6M People Fooled Us All

https://substack.com/home/post/p-169089429
1•kadrek•4m ago•0 comments

The New Hot Topic in European Politics Is Air Conditioning

https://www.wsj.com/world/europe/europe-air-condition-heat-waves-politics-24aceab4
1•edward•5m ago•0 comments

ClickHouse 25.6: CoalescingMergeTree table engine

https://clickhouse.com/blog/clickhouse-25-6-coalescingmergetree
1•samaysharma•17m ago•0 comments

Executive Order – Preventing Woke AI in the Federal Government

https://www.whitehouse.gov/presidential-actions/2025/07/preventing-woke-ai-in-the-federal-government/
5•hooverd•17m ago•0 comments

Teens say they are turning to AI for friendship

https://apnews.com/article/ai-companion-generative-teens-mental-health-9ce59a2b250f3bd0187a717ffa2ad21f
1•hackernj•23m ago•0 comments

The Shady Job Pipeline Hiding in Plain Sight

https://www.nashvillescene.com/news/citylimits/zeal-tn/article_d6ed517c-1a66-4298-9b4a-edcbdaf0a7b2.html
1•toomuchtodo•24m ago•0 comments

Exploring Art Is Like Following a Spiral – Meet Chameleon

https://apps.apple.com/ar/app/chameleon-art-journey/id6748627332
1•victoriaaaali•25m ago•1 comments

Self-hosted slippy maps, for novices (like me)

https://blog.apps.npr.org/2024/11/26/slippy-maps.html
1•brendanashworth•25m ago•0 comments

A DOJ Whistleblower Speaks Out

https://www.nytimes.com/2025/07/23/podcasts/the-daily/a-doj-whistleblower-speaks-out.html
2•awnird•27m ago•0 comments

Ask HN: Even with AGI, it wouldn't know what you know. Can we preserve that?

1•consumer451•29m ago•2 comments

The State of Zero Trust Report 2025 – Tailscale

https://tailscale.com/resources/report/zero-trust-report-2025
2•WillDaSilva•32m ago•1 comments

Summarize a GitHub release changelog into a social media post

https://github.com/humanwhocodes/social-changelog
1•mooreds•32m ago•0 comments

Complete the Square: Can you get to Level 200?

https://kully.itch.io/complete-the-square
1•akully•37m ago•1 comments

AI Coding Stack That Isn't Complete Garbage: VSCode, Roocode, Augment (May 2025)

https://medium.com/@neonmaxima/the-only-ai-coding-stack-that-isnt-complete-garbage-vscode-rocode-augment-and-why-claude-is-eb4352bd0e3d
1•e2e4•41m ago•1 comments

Morally corrupt innovations are the easiest innovations to create

https://ceoretort.com/journal/ethics/2025/05/16/morally-corrupt-innovations-are-the-easiest-innovations-to-create-its-the-lazy-approach-with-dangerous-consequences/
2•rbanffy•43m ago•0 comments

I made Tinder but it's only pictures of my wife and I can only swipe right

https://trytender.app/
151•risquer•43m ago•33 comments

How big tech is force-feeding us AI

https://www.bloodinthemachine.com/p/how-big-tech-is-force-feeding-us
2•archagon•44m ago•1 comments

MatrixTransformer: Structural Pattern Discovery Without Training

1•AyodeleFikayomi•46m ago•0 comments

Genetic Switch in Mosquitoes Halts Malaria Spread

https://today.ucsd.edu/story/stealth-genetic-switch-in-mosquitoes-halts-malaria-spread
1•geox•47m ago•0 comments

Show HN: Palworld Breeding Calculator – Breeding Tree and Combination Visualizer

https://www.palworldbreedingcalculator.org/
1•droidHZ•47m ago•0 comments

Alphabet Q2 FY25: Total Rev +14% Y/Y to $96B Google Cloud +32% Y/Y to $13.6B

https://twitter.com/EconomyApp/status/1948115105461543363
2•donsupreme•48m ago•0 comments

Sweet spot for daily steps is lower than often thought, new study finds

https://theconversation.com/sweet-spot-for-daily-steps-is-lower-than-often-thought-new-study-finds-261605
3•zeristor•50m ago•0 comments

Ask HN: What's your biggest productivity killer as a developer?

1•x1MA-EGT85•54m ago•1 comments

Winning the Race: America's AI Action Plan [pdf]

https://www.whitehouse.gov/wp-content/uploads/2025/07/Americas-AI-Action-Plan.pdf
1•JnBrymn•54m ago•1 comments

US taxpayer-funded vaccine doses may expire, lawmakers say

https://www.reuters.com/business/healthcare-pharmaceuticals/hundreds-thousands-us-taxpayer-funded-vaccine-doses-may-expire-lawmakers-say-2025-07-23/
3•Anon84•55m ago•0 comments

Google Cloud's Approach to Change

https://cloud.google.com/docs/cloud-approach-to-change
2•nickzana•58m ago•0 comments

T-Mobile's Starlink Satellite Service Officially Launches with iPhone Support

https://www.macrumors.com/2025/07/23/t-mobile-starlink-launch/
2•tosh•1h ago•0 comments

Novel material efficiently removes 'forever chemicals'

https://phys.org/news/2025-07-material-efficiently-chemicals.amp
1•Jimmc414•1h ago•0 comments

Ask HN: Are you designing APIs to be "AI-ready"?

1•CER10TY•1h ago•0 comments

GitHub Spark in public preview for Copilot Pro+ subscribers

https://github.blog/changelog/2025-07-23-github-spark-in-public-preview-for-copilot-pro-subscribers/
2•CharlesW•1h ago•0 comments