frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Cloudflare outage should not have happened

https://ebellani.github.io/blog/2025/cloudflare-outage-should-not-have-happened-and-they-seem-to-be-missing-the-point-on-how-to-avoid-it-in-the-future/
35•b-man•47m ago

Comments

mikece•16m ago
Yes, pretty basic looking mistakes that, from the outside, make many wonder how this got through. Though analyzing the post-mortem makes me think of the MV Dali crashing into the Francis Scott Key bridge in Baltimore: the whole thing started with a single loose wire which set off a cascading failure. CF's situation was similar in a few ways though finding a bad query (and .unwrap() in production code rather than test code) should have been a lot easier to spot.

Have any of the post-mortems addressed if any of the code that led to CloudFlare's outage was generated by AI?

cmckn•14m ago
I agree it should not have happened, but I don’t agree that the database schema is the core problem. The “logical single point of failure” here was created by the rapid, global deployment process. If you don’t want to take down all of prod, you can’t update all of prod at the same time. Gradual deployments are a more reliable defense against bugs than careful programming.
locknitpicker•12m ago
This sort of Monday morning quarterbacking is pointless and only serves as a way for random bloggers to try to grab credit without actually doing or creating any value.
nmoura•11m ago
I disagree. I learnt good stuff from this article and it’s enough.
vessenes•11m ago
"If they had a perfectly normalized database, no NULLing and formally verified code, this bug would not have happened."

That may be. What's not specified there is the immense, immense cost of driving a dev org on those terms. It limits, radically, the percent of engineers you can hire (to those who understand this and are willing to work this way), and it slows deployment radically.

Cloudflare may well need to transition to this sort of engineering culture, but there is no doubt that they would not be in the position they are in if they started with this culture -- they would have been too slow to capture the market.

I think critiques that have actionable plans for real dev teams are likely to be more useful than what, to me, reads as a sort of complaint from an ivory tower. Culture matters, shipping speed matters, quality matters, team DNA matters. That's what makes this stuff hard (and interesting!)

etchalon•9m ago
"This massive, accomplished engineering team whose software operates at a scale nearly no one else operates at missed this basic thing" is a hell of a take.
hvb2•8m ago
> A central database query didn’t have the right constraints to express business rules. Not only it missed the database name, but it clearly needs a distinct and a limit, since these seem to be crucial business rules.

In a database, you wouldn't solve this with a distinct or a limit? You would make the schema guarantee uniqueness?

And yes, that wouldn't deal with cross database queries. But the solution here is just the filter by db name, the rest is table design.

nine_k•5m ago
* The unwrap() in production code should have never passed code review. Damn, it should have been flagged by a linter.

* The deployment should have followed the blue/green pattern, limiting the blast radius of a bad change to a subset of nodes.

* In general, a company so much at the foundational level of internet connectivity should not follow the "move fast, break things" pattern. They did not have an overwhelming reason to hurry and take risks. This has burned a lot of trust, no matter the nature of the actual bug.

tptacek•4m ago
Cloudflare doesn't seem to have called it a "Root Cause Analysis" and, in fact, the term "root cause" doesn't appear to occur in Prince's report. I bring this up because there's a school of thought that says "root cause analysis" is counterproductive: complex systems are always balanced on the precipice of multicausal failure.

Voyager 1 Is About to Reach One Light-Day from Earth

https://scienceclock.com/voyager-1-is-about-to-reach-one-light-day-from-earth/
303•ashishgupta2209•3h ago•97 comments

Cloudflare outage should not have happened

https://ebellani.github.io/blog/2025/cloudflare-outage-should-not-have-happened-and-they-seem-to-...
41•b-man•47m ago•11 comments

OpenAI needs to raise at least $207B by 2030 so it can continue to lose money

https://ft.com/content/23e54a28-6f63-4533-ab96-3756d9c88bad
333•akira_067•2h ago•250 comments

I don't care how well your "AI" works

https://fokus.cool/2025/11/25/i-dont-care-how-well-your-ai-works.html
392•todsacerdoti•7h ago•496 comments

A cell so minimal that it challenges definitions of life

https://www.quantamagazine.org/a-cell-so-minimal-that-it-challenges-definitions-of-life-20251124/
143•ibobev•7h ago•67 comments

Statistical Process Control in Python

https://timothyfraser.com/sigma/statistical-process-control-in-python.html
145•lifeisstillgood•8h ago•43 comments

Optery (YC W22) Hiring CISO, Release Manager, Tech Lead (Node), Full Stack Eng

https://www.optery.com/careers/
1•beyondd•19m ago

Show HN: I turned algae into a bio-altimeter and put it on a weather balloon

https://radi8.dev/blog/stratospore/
23•radeeyate•4d ago•2 comments

Slashdot Effect

https://en.wikipedia.org/wiki/Slashdot_effect
27•firefax•1h ago•24 comments

JOPA: Java compiler in C++, Jikes modernized to Java 6 with Claude

https://github.com/7mind/jopa
11•pshirshov•3d ago•3 comments

Show HN: KiDoom – Running DOOM on PCB Traces

https://www.mikeayles.com/#kidoom
295•mikeayles•19h ago•40 comments

Is DWPD Still a Useful SSD Spec?

https://klarasystems.com/articles/is-dwpd-still-useful-ssd-spec/
33•zdw•5d ago•14 comments

Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos

https://arxiv.org/abs/2511.19936
79•50kIters•9h ago•12 comments

Surprisingly, Emacs on Android is pretty good

https://kristofferbalintona.me/posts/202505291438/
197•harryday•3d ago•100 comments

Qiskit open-source SDK for working with quantum computers

https://github.com/Qiskit/qiskit
21•thinkingemote•4h ago•1 comments

Justice dept. requires Realpage end sharing competitively sensitive information

https://www.justice.gov/opa/pr/justice-department-requires-realpage-end-sharing-competitively-sen...
40•phkahler•2h ago•23 comments

Copyparty, the FOSS file server [video]

https://www.youtube.com/watch?v=15_-hgsX2V0
159•franczesko•6d ago•45 comments

I DM'd a Korean presidential candidate and ended up building his core campaign

https://medium.com/@wjsdj2008/i-dmd-a-korean-presidential-candidate-and-ended-up-building-his-cor...
107•wjsdj2009•3h ago•57 comments

Trillions spent and big software projects are still failing

https://spectrum.ieee.org/it-management-software-failures
558•pseudolus•1d ago•517 comments

Efficient solar cooking that stores heat in sand

https://www.sciencedirect.com/science/article/pii/S266711312500035X
62•gsf_emergency_6•2d ago•33 comments

Jakarta is now the biggest city in the world

https://www.axios.com/2025/11/24/jakarta-tokyo-worlds-biggest-city-population
393•skx001•1d ago•309 comments

CS234: Reinforcement Learning Winter 2025

https://web.stanford.edu/class/cs234/
166•jonbaer•16h ago•34 comments

Show HN: We built an open source, zero webhooks payment processor

https://github.com/flowglad/flowglad
344•agreeahmed•23h ago•197 comments

1,700-year-old Roman sarcophagus is unearthed in Budapest

https://apnews.com/article/hungary-roman-sarcophagus-discovery-budapest-77a41fe190bbcc167b43d0514...
120•gmays•1d ago•69 comments

How to repurpose your old phone into a web server

https://far.computer/how-to/
289•louismerlin•3d ago•103 comments

A new bridge links the math of infinity to computer science

https://www.quantamagazine.org/a-new-bridge-links-the-strange-math-of-infinity-to-computer-scienc...
218•digital55•21h ago•123 comments

Launch HN: Onyx (YC W24) – Open-source chat UI

217•Weves•1d ago•142 comments

FLUX.2: Frontier Visual Intelligence

https://bfl.ai/blog/flux-2
341•meetpateltech•1d ago•99 comments

Java Decompiler

http://java-decompiler.github.io
109•mooreds•3d ago•44 comments

Python is not a great language for data science

https://blog.genesmindsmachines.com/p/python-is-not-a-great-language-for
299•speckx•1d ago•279 comments