frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: What did you learn from AWS outages?

2•Brysonbw•2h ago

Comments

merek•32m ago
I figured if a single AZ has an outage, let alone the entire region, I can rest easy knowing much bigger companies will have bigger problems. It will probably be newsworthy, and when customers email in, my excuse will be defensible, since I can send them links to external status pages, news articles, etc.

Whilst this was mostly true, it was still a very unpleasant experience, and my service was hanging by a thread for much of the time. I recently moved an important part of the stack from EC2 to Fargate, with two services: a single task to post jobs to a queue, and another service running many tasks to process jobs from the queue.

The incident knocked out the job posting service, which would not come back up. Had I left it to AWS to resolve automatically, my service would have been out for maybe 12 hours.

Fortunately the worker tasks were still available and waiting. I tracked down the old "job poster" code that used to run on an ec2. I sshed into an old ec2, and "deployed" the code by copying and pasting onto the server. The service came back up, although I had to edit the code directly on the ec2 to slow things down, since the ec2 had 1vCPU and an upgrade was not possible during the incident. Furthermore, Fargate workers would not scale out if they had too much work.

This was at about 2 or 3 AM my time, and was carried out whilst customers were emailing in, and cloudwatch alarms were going off all over the place. Once the service was back up, even with my unnerving hacky solution, I got a couple hours sleep.

What I've learnt:

- When the incident was first reported, I thought it would last 2 hours max. A 12 - 16 hour disruption to AWS resources is absolutely possible.

- Maybe don't use us-east-1 for future projects, but I'm not convinced there's much logic to this. Despite past issues, it's impossible to predict where an outage might occur and the affected resources, as well as spillover into other regions.

- Think of ways to make my service more portable, to other regions, even other cloud providers, but the motivation to do this will be gone by tomorrow. It's way more valuable for me to focus on customers, new features, etc, rather than bomb-proofing the service. I don't write airline or medical software. An outage of my service isn't going to kill anyone, and most users are understanding. I'll accept the hit.

The Kaleidoscope (2025)

https://www.spiral.camp/dispatches/2025-10-19-the-kaleidoscope/
1•shredprez•3m ago•0 comments

Markupstandards.org

https://markupstandards.org/
3•devonnull•5m ago•0 comments

Sony: Tech Giant Seeks U.S. Bank License to Issue Its Own Stablecoin

https://cryptonews.com/news/sony-seeks-u-s-bank-license-to-issue-its-own-stablecoin/
1•pseudolus•14m ago•1 comments

Elon Musk now owns 2/3 of satellites after 10,000th Starlink launches

https://www.independent.co.uk/space/elon-musk-starlink-satellites-spacex-b2848690.html
1•anigbrowl•20m ago•0 comments

Qwen Language Confusion Gate

https://arxiv.org/abs/2510.17555
2•CollinZ•20m ago•0 comments

Motion to Dismiss for Failure to State a Vulnerability

https://alexgaynor.net/2025/oct/20/motion-to-dismiss/
2•tptacek•23m ago•0 comments

We Can't Name Variables. Now We're Writing Prompts?

https://davidadamojr.com/we-cant-name-variables-now-were-writing-prompts/
3•dtgeadamo•23m ago•1 comments

Oracle Vectorizes Its Customers Data

https://www.theregister.com/2025/10/16/oracle_vectorizes_its_customers/
1•pabs3•23m ago•0 comments

Analytics.USA.gov: U.S. Federal Government Website and App Analytics

https://analytics.usa.gov
1•ronbenton•25m ago•0 comments

Incrementing and decrementing an atomic reference count

https://devblogs.microsoft.com/oldnewthing/20251015-00/?p=111686
1•ibobev•26m ago•0 comments

Soupault – A Static Website Management Tool

https://soupault.app/
1•nairadithya•29m ago•0 comments

Behavioural scanners in Mannheim: testing surveillance that so many cities want

https://netzpolitik.org/2025/verhaltensscanner-im-mannheim-hier-wird-die-ueberwachung-getestet-di...
1•pabs3•33m ago•0 comments

Python notebook of Princeton GraphMERT Paper – a better knowledge graph

https://github.com/creativeautomaton/graphMERT-python
2•7jewve5rws•38m ago•1 comments

Foreign hackers breached a US nuclear weapons plant via SharePoint flaws

https://www.csoonline.com/article/4074962/foreign-hackers-breached-a-us-nuclear-weapons-plant-via...
2•jnord•40m ago•1 comments

Marine artillery shell detonates over freeway during Camp Pendleton event

https://taskandpurpose.com/news/camp-pendleton-marines-artillery-freeway/
1•uticus•40m ago•0 comments

iOS 26.1 Beta 4 Adds Liquid Glass Transparency Toggle

https://512pixels.net/2025/10/os-26-1-beta-4-adds-liquid-glass-transparency-toggle/
1•soheilpro•41m ago•0 comments

Normalize.css

https://csstools.github.io/normalize.css/
1•Leftium•42m ago•0 comments

Google's Pixel 10 can now run Linux apps better than other Android phones

https://www.androidauthority.com/pixel-10-linux-apps-gpu-acceleration-3608754/
1•sipofwater•44m ago•3 comments

Thoughts? "Nvidia in 5y btw $1300 and $4K" based on analysis from the link

https://www.nasdaq.com/articles/prediction-nvidia-stock-price-will-skyrocket-range-5-years
1•nomendos•45m ago•1 comments

Argentine peso weakens to fresh low despite US interventions

https://www.ft.com/content/815ef487-0d0e-430c-b140-9bc39dbd1a53
15•zerosizedweasle•50m ago•12 comments

Supreme Court will consider whether people who smoke pot can legally own guns

https://apnews.com/article/supreme-court-marijuana-guns-e86c342bf248c7822722ad027980b72b
6•Jimmc414•51m ago•1 comments

Wikipedia says traffic is falling due to AI search summaries and social video

https://techcrunch.com/2025/10/18/wikipedia-says-traffic-is-falling-due-to-ai-search-summaries-an...
18•gmays•56m ago•0 comments

OpenAI is not a serious company

4•johnnyApplePRNG•59m ago•1 comments

George F. Smoot, Who Showed How the Cosmos Began, Is Dead at 80

https://www.nytimes.com/2025/10/20/science/space/george-f-smoot-dead.html
5•bookofjoe•1h ago•2 comments

Can a University from Tennessee Help Accelerate Growth in West Palm Beach?

https://www.nytimes.com/2025/10/19/business/vanderbilt-university-expansion.html
1•paulpauper•1h ago•0 comments

An IKEA Catalog from the Near Future

https://shop.nearfuturelaboratory.com/products/ikea-catalog-from-the-near-future
1•dannyrosen•1h ago•0 comments

One Star

https://www.vice.com/en/article/one-star/
2•prawn•1h ago•0 comments

Space Debris Hits Plane (?)

https://twitter.com/Turbinetraveler/status/1979652027345940536
2•boringg•1h ago•0 comments

Lottery-Fication of Everything

https://www.dopaminemarkets.com/p/the-lottery-fication-of-everything
1•_1729•1h ago•0 comments

Tech PACs Are Closing in on the Almonds

https://www.astralcodexten.com/p/tech-pacs-are-closing-in-on-the-almonds
1•toomuchtodo•1h ago•0 comments