Major Azure Outage
Public clouds will have outages - that's not the point. What's most concerning about this outage is that it is across all regions. That violates the fundamental assumption, of developing for the cloud, that failures in every region are independent.
If regions fail independently and a failure in 1-2 regions brought down my system, that's my fault. But if region failures aren't independent and a global outage such as this is possible - well, that's pretty bad.
Man... Azure seems to be an order of magnitude worse than AWS and GCP when it comes to reliability.
Seems like they have tons of global dependencies within their services which cause these cascading failures rather often... Seems like only a few months ago we were reading about a global outage that affected auth?
Regardless: Godspeed to the engineers working to fix this.
On a more serious note, how would your entire network, worldwide go down? Are there really no independent zones (that are unaware of each other)? That can’t be good.
When I tell my clients Azure had another outage they're going to demand we move to another cloud service. Looks like I'm in for a looong couple weeks.
https://azure.microsoft.com/en-us/status/ was just updated, showing "Network Infrastructure" is down across the board.
Azure SQL is totally down for us, Storage (tables/blobs/queues) is mostly down. Seems to be a DNS issue, and this wouldn’t be the first time Microsoft has been brought down by DNS.
Azure SQL is totally down for us as well. Unable to resolve the DNS for it, both from within azure (kubernetes pods) and from outside (my laptop).
EDIT (23:37 Oslo/Norway): connectivity is restored for us now
I'm seeing some recovery of services in West US
Things have been back up for several minutes.
Recovery started in Europe (AMS)
undefined