3/3/2026 at 11:43:58 AM
I've been working on that for a client since yesterday (as a fractional CTO). Pretty hectic, basically nothing really works and we don't know yet if all data is lost or if anything is recoverable or when AWS UAE will become functional again so we can recover that region.Finally, I have a very good argument for multi-region deployments ;))
that's my go to website atm: https://health.aws.amazon.com/health/status
by RamblingCTO
3/3/2026 at 11:55:49 AM
We didn't do multi-region deployments, but we did store database backups in a separate region just in case something really bad happened and our AWS region became unavailable. Also had a plan/some ready Terraform stuff in order to start setting up a deployment if it became apparent that the region wasn't coming back anytime soon.IMO, if you're using AWS and not replicating your data somewhere else, this should be an eye-opener for you.
by Hamuko
3/3/2026 at 1:56:08 PM
Not sure why everyone read this as me doing anything here, I'm a fractional CTO, which is kind of an advisor. Nothing invaluable will be lost tho. It's not the core platform, just a localized version for specific customers in the region.by RamblingCTO
3/3/2026 at 11:54:41 AM
Severity: DisruptedSo if data won't be recoverable you all will mark it something like "Status: FUBAR" or some equivalent term?
by crossroadsguy
3/3/2026 at 12:13:47 PM
I wonder if ‘Apocalypse’ or ‘Molten Slag’ would be considered professional enough.by lazide
3/3/2026 at 12:14:49 PM
I'd accept "X_X" as status.by bayindirh
3/4/2026 at 2:23:10 AM
Status: Torpedoedby latentsea
3/3/2026 at 12:18:15 PM
FUBAR, I would vote for thatby BonoboIO
3/3/2026 at 12:24:22 PM
What do you mean 'finally' - surely 'redundancy' or 'natural disaster' is reason enough.by richsouth
3/3/2026 at 1:55:25 PM
To a lot of managers/startup execs this is something like "we won't ever need this". And I'd agree to some extent for not so important/rebuildable services. Depends on what you need. In startups, you don't have infinite time you can spend on stuff. But this makes a good case: if a geographical region only has one AWS region, don't keep data or run services that can't be easily rebuild somewhere else. In europe you can just pick two AWS regions and you stay in the same regions, UAE not so much.by RamblingCTO
3/4/2026 at 12:59:59 PM
There’s a reason literally every enterprise of any size in the early 00s not only had a DR site, but a DR plan they would test multiple times per year.1. Just having a copy in two places doesn’t mean the copies are both good or that you can get DR online in any reasonable amount of time.
2. You quickly figure out all the things you thought “weren’t important” that prevent you from actually doing a successful DR test.
They amount of things that “cloud first” people just assume naturally takes care of itself because it’s in the cloud always amuses me.
by tw04
3/4/2026 at 1:22:49 PM
You should have that for your main data, yes. I insist on that as well. Making backups, cold storage, and getting them back online. Yearly is not enough. My point was that not everything needs a backup or a DR. Spend your time wisely, we're talking startups, not enterprise. Can't talk more specifics obviouslyby RamblingCTO
3/4/2026 at 1:54:06 PM
The fact half the internet was down the last time US east was down tells me it’s not just startups that foolishly think they don’t need actively tested dr copies and plans when they move to the cloud.by tw04
3/3/2026 at 1:54:13 PM
I don't think this changes anything. It is always the same argument for me."How often do those happen?"
by doubled112
3/3/2026 at 1:59:00 PM
That was my point: now we have a very specific case we can argue. I always used the "what if a plane crashes into the data center" argument. Now I can say: in one of my engagements, we lost a datacenter completely because of a drone attack. That's a first for AWS as well, but we can draw from reality, not hypotheticals.by RamblingCTO
3/3/2026 at 11:45:07 PM
So AWS's availability zone architecture hasn't helped in this case?by jamesfinlayson
3/4/2026 at 1:03:59 AM
If a whole region is bombed then you wouldn’t expect that region to stay up.by MagicMoonlight
3/4/2026 at 1:21:08 PM
All AZs were affectedby RamblingCTO
3/3/2026 at 1:08:41 PM
You don't already have a DR plan in place?by SirFatty
3/3/2026 at 2:00:04 PM
Of course, no worries. Nothing irreplaceable will be lost ;) It was meant as a general example to be used in future arguments for everybody in tech leadership.by RamblingCTO