frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Beyond Downtime: Architectural Resilience on Hyperscalers

https://cacm.acm.org/blogcacm/beyond-downtime-architectural-resilience-on-hyperscalers/
3•rbanffy•6h ago

Comments

jiggawatts•37m ago
This is a low-value article that reads like it is AI generated even if it’s not.

Almost every instance of downtime I’ve experienced in the cloud was due to a global outage of some sort that no amount of regional redundancy could fix.

Regional redundancy is typically twice as expensive at small scales and decidedly non-trivial to implement because… where do you put your data? At most one region can have low-latency access, all others have to deal with either eventual consistency OR very high latencies! What happens during a network partition? That’s… “fun” to deal with!

Most groups would benefit far more from simply having seamless DevOps deploys and fast rollback.

Neither is available by default in most cloud platforms, you have to build it from fiddly little pieces like off-brand LEGO.

Proprietary pieces with no local dev experience such as syntax validation and emulators.

toast0•13m ago
Certainly big cloud outages involve global outages, and some regional outages cascade into global outages.

But it's pretty common for a major event to happen in a single region. Datacenter fires and/or flooding happen from time to time. Extreme weather can happen. Automatic transfer switches fail from time to time. Fiber cuts happen.

Not everyone needs regional redundancy, and it does add costs, but I don't think it should be dismissed easily. If you're all in on cloudiness, you could have as little as an alternate region replica of your data and your vm images, and be ready to go manually in another region if you need to. Run some tests once or twice a year to confirm your plan works, and to make an estimate for how long it takes to restore service in the event of a regional outage. A few minutes to put up an outage page and an hour or three to restore service is probably fine... Automatic regional failover gets tricky with data consistency and split brain as you mentioned; and hopefully you don't need to do it often.

jiggawatts•9m ago
> But it's pretty common for a major event to happen in a single region.

It's actually pretty rare these days because all major clouds use zone-redundancy and hence their core services are robust to the loss of any single building. Even during the recent Iberian power outages the local cloud sites mostly (entirely?) stayed up.

The outages I've experienced over the last decade(!) were: Global certificate expiry (Azure), Crowdstrike (Windows everywhere), IAM services down globally (AWS), core inter-region router misconfiguration (customer-wide).

None would have been avoided by having more replicas in more places. All of our production systems are already zone-redundant, which is either the default or "just a checkbox" in most clouds.

This article adds no value to the discussion because it states the problem that's not that big a deal, and then doesn't provide any useful solutions for the few people where it is a big deal.

The problem is either easy to solve -- tick the checkbox for zone-redundancy -- or very difficult to solve -- make your app's data globally replicated -- and the article just says "you should do it" without further elaboration.

That's of no value to anyone.

Intel's Lion Cove P-Core and Gaming Workloads

https://chipsandcheese.com/p/intels-lion-cove-p-core-and-gaming
2•zdw•1m ago•0 comments

A non-anthropomorphized view of LLMs

http://addxorrol.blogspot.com/2025/07/a-non-anthropomorphized-view-of-llms.html
1•zdw•1m ago•0 comments

Battle of Vukovar: how 1,800 fighters held off a force of 36,000

https://en.wikipedia.org/wiki/Battle_of_Vukovar
1•felineflock•4m ago•0 comments

Derivative Eigenfunctions

https://www.ryantolsma.com/thoughts/2025/07/06/discrete-derivative.html
1•rtolsma•5m ago•1 comments

Ask HN: How do I buy a typewriter?

2•indus•6m ago•2 comments

Room to Think

https://remarkable.com/roomtothink
1•tmseidman•6m ago•0 comments

mTLS vs. HTTP Message Signatures: Tradeoffs in Securing HTTP Requests

1•getvictor•11m ago•0 comments

Nobody has a personality anymore: we are products with labels

https://www.freyaindia.co.uk/p/nobody-has-a-personality-anymore
1•drankl•12m ago•0 comments

Fines coming for Californians caught by drone with illegal fireworks

https://www.sfgate.com/bayarea/article/california-drones-illegal-fireworks-20629637.php
1•c420•12m ago•0 comments

Code and Trust: Vibrators to Pacemakers

https://punkx.org/jackdoe/code-and-trust.html
1•jackdoe•13m ago•0 comments

New Horizons images enable first test of interstellar navigation

https://www.newscientist.com/article/2486823-new-horizons-images-enable-first-test-of-interstellar-navigation/
1•jnord•16m ago•0 comments

Strategies to Better Resist Distractions

https://www.psychologytoday.com/us/blog/in-practice/202507/3-strategies-to-better-resist-distractions
1•exiguus•20m ago•0 comments

Trump's BBB has $85M to move space shuttle Discovery from Smithsonian to Texas

https://www.space.com/space-exploration/space-shuttle/trumps-signing-of-one-big-beautiful-bill-includes-usd85-million-to-move-space-shuttle-discovery-from-smithsonian-to-texas
4•zzzeek•23m ago•3 comments

The New Corporate Memo: Let AI Ease the Pain

https://gizmodo.com/the-new-corporate-memo-let-ai-ease-the-pain-2000624537
2•rntn•30m ago•0 comments

Record-Breaking Results Bring Fusion Power Closer to Reality

https://www.scientificamerican.com/article/record-breaking-results-bring-fusion-power-closer-to-reality/
2•saubeidl•32m ago•0 comments

iOS app using color filter manipulation

1•camputer_•35m ago•0 comments

Early Triassic super-greenhouse climate driven by vegetation collapse

https://www.nature.com/articles/s41467-025-60396-y
3•benbreen•36m ago•0 comments

The Origin of the Research University

https://asteriskmag.com/issues/10/the-origin-of-the-research-university
1•Petiver•37m ago•0 comments

CSS conditionals with the new if() function

https://developer.chrome.com/blog/if-article
1•Destiner•41m ago•0 comments

Frustrated with my Mac constantly lowering the microphone

https://incubo4u.com/
1•incubo4u•41m ago•0 comments

Building the Rust Compiler with GCC

https://fractalfir.github.io/generated_html/cg_gcc_bootstrap.html
22•todsacerdoti•42m ago•0 comments

'Great Dying' wiped out 90% of life, then came 5M years of lethal heat

https://www.cnn.com/2025/07/02/climate/great-dying-extinction-tipping-point-tropical-forests
4•Bluestein•43m ago•2 comments

Useful Utilities and Toys over DNS

https://www.dns.toys/
1•thunderbong•47m ago•0 comments

Context Engineering

https://blog.langchain.com/context-engineering-for-agents/
2•JnBrymn•50m ago•0 comments

LLMs should not replace therapists

https://arxiv.org/abs/2504.18412
32•layer8•1h ago•16 comments

Why English doesn't use accents

https://www.deadlanguagesociety.com/p/why-english-doesnt-use-accents
19•sandbach•1h ago•1 comments

Show HN: FitmMetr – A privacy-first health tracker built by a CSO

https://fitmetr.app/
1•psvisualdesign•1h ago•1 comments

Agentic Coding – Copilot to Coworker

https://jasondsouza.org/post/agentic-coding
1•jasonrdsouza•1h ago•0 comments

Quantum microtubule substrate of consciousness is experimentally supported

https://pmc.ncbi.nlm.nih.gov/articles/PMC12060853/
2•greyface-•1h ago•0 comments

Show HN: Create video tours of your data for YT Shorts, IG Reels and TikTok

https://github.com/datareels/datareels.github.io
1•aaurelions•1h ago•0 comments