edit: Never mind, it's down for me now as well.
- SSO issues;
- Google workspace tools not loading;
current time: 2025-07-18T15:35:43+00:00 12h35 GMT-3
Seems to be some hardware problem at least in us-east1
https://status.cloud.google.com/incidents/8cY8jdUpEGGbsSMSQk...
Anyone who says otherwise is selling availability theater
Too many whole-cloud outages due to a bad config in the last 2 months (GCP x2, cloudflare x2)
Whole-cloud outages are pretty damn rare. The recent GCP issues are an exception to the general rule.
I’d posit that the complexity of a multi-cloud setup is generally going to reduce your service’s reliability more than relying on a single cloud does.
Really?
AWS (EC2) does: https://aws.amazon.com/compute/sla/?did=sla_card&trk=sla_car... so does GCP (GCE): https://cloud.google.com/compute/sla?hl=en and so does OVH: https://us.ovhcloud.com/legal/sla/public-cloud/
Are none of those three part of "most clouds"? What cloud platform do you use?
You are correct that it's "better" though if your goal is to have as many 9's of uptime as possible.
In my current job as a technical due diligence advisor, I frequently recommend multi-AZ setup but specifically not multi-region, because the former is easy and worthwhile while the latter carries a lot more operational overhead (you become much more sensitive to various latencies and network jitters) and you now need to think about things like synchronous vs async replication, etc. Much better to focus dev effort on the product, rather than eke out an additional .001% of availability (unless availability is a super critical component).
Probably because it's hard to form long-term memories when you're sleep-deprived :/
B2B customers don’t care if the other sites are also down, your SLA is affected with them, and they will want compensation.
staletofu•14h ago