Puzzle Technology
← All incidents

Active – Multi Service degradation in West US 2

Major · resolved · started 29 May 2026, 05:27 · resolved 29 May 2026, 22:40

  1. resolved · 29 May 2026, 22:40

    Auto-resolved by the status page: the vendor stopped reporting this incident in their feed.

  2. investigating · 29 May 2026, 19:22

    Impact Statement: Starting at 04:27 UTC on 29 May 2026, a severe thunderstorm caused widespread utility power loss to our West US 2 datacenter facilities, resulting in a multi-service outage. Datacenter power and network infrastructure have been fully restored. The majority of services have recovered, with residual impact limited to storage-dependent workloads on two remaining storage stamps undergoing tail recovery. Customers with dependencies on two outstanding storage stamps may still experience intermittent connectivity or elevated latency. All other services are operating normally or nearing full restoration. Currently affected Azure services include, but are not limited to (recovery validation underway): Azure Functions, Azure Database for MySQL Flexible Servers, Azure Database for PostgreSQL Flexible Server, Azure SQL, Azure Managed Grafana, Virtual Machines, Virtual Machine Scale Sets, Azure Kubernetes Service, Storage, Application Insights, Azure Monitor, and Azure Log Analytics Current Status: A severe weather event, including multiple lightning strikes, affected several datacenter facilities in the West US 2 region, resulting in concurrent utility power interruptions across multiple availability zones. Backup generators activated as designed; however, during the transition to sustained generator operation, a subset of generator systems were unable to fully synchronize under the sudden facility load, while others subsequently shut down due to thermal protection mechanisms as cooling systems were impacted by the broader power disruption. These events occurred across multiple facilities within the same timeframe, thus resulting in multiple zones being impacted concurrently. Datacenter utility power has been fully restored, and network infrastructure is fully operational. HVAC systems have returned to normal operation, and ambient temperatures are within expected ranges. The majority of storage infrastructure has recovered successfully, with two storage stamps currently completing final recovery and integrity validation activities. On-site teams continue to perform validation and bring the remaining resources online. Services dependent on these storage stamps are expected to recover as these final activities are completed. The following services have been confirmed restored and are operating normally: Service Bus, App Service (Web Apps), Azure Site Recovery, Backup (MAB), Azure Cosmos DB, Azure Resource Manager, Data Explorer, Azure IoT Hub, Microsoft Defender for Cloud Apps, Azure Container Registry, Azure Policy, Azure NetApp Files, Azure Resource Graph, Azure Data Factory, Azure Databricks, Redis, and Azure Synapse Estimated Time to Resolution: With datacenter power back online, our network infrastructure fully restored and with majority of storage back online, remaining recovery is limited to two storage stamps completing the final stages of recovery and data integrity checks. We estimate full service restoration within approximately 3–4 hours. Individual services dependent on these stamps will come back online progressively as validation completes. Customer Guidance: If you have resources in paired or alternate regions, consider failing over traffic away from West US 2 until we confirm full recovery. Single-region workloads will recover automatically as infrastructure is restored — no customer action is required. We recommend pausing new deployments to West US 2 until this incident is resolved; use alternate regions if urgent. Monitor Azure Status (https://azure.status.microsoft/en-us/status) and Service Health in the portal for real-time updates specific to your subscriptions. Next update: We will provide our next status update within the next 60 minutes, or sooner if events warrant.

  3. investigating · 29 May 2026, 19:14

    Impact Statement: Starting at 04:27 UTC on 29 May 2026, a severe thunderstorm caused widespread utility power loss to our West US 2 datacenter facilities, resulting in a multi-service outage. Datacenter power and network infrastructure have been fully restored. The majority of services have recovered, with residual impact limited to storage-dependent workloads on two remaining storage stamps undergoing tail recovery. Customers with dependencies on two outstanding storage stamps may still experience intermittent connectivity or elevated latency. All other services are operating normally or nearing full restoration. Currently affected Azure services include, but are not limited to (recovery validation underway): Azure Functions, Azure Database for MySQL Flexible Servers, Azure Database for PostgreSQL Flexible Server, Azure Databricks, Redis, Azure SQL, Azure Managed Grafana, Virtual Machines, Virtual Machine Scale Sets, Azure Kubernetes Service, Storage, Application Insights, Azure Monitor, and Azure Log Analytics Current Status: A severe weather event, including multiple lightning strikes, affected several datacenter facilities in the West US 2 region, resulting in concurrent utility power interruptions across multiple availability zones. Backup generators activated as designed; however, during the transition to sustained generator operation, a subset of generator systems were unable to fully synchronize under the sudden facility load, while others subsequently shut down due to thermal protection mechanisms as cooling systems were impacted by the broader power disruption. These events occurred across multiple facilities within the same timeframe, thus resulting in multiple zones being impacted concurrently. Datacenter utility power has been fully restored, and network infrastructure is fully operational. HVAC systems have returned to normal operation, and ambient temperatures are within expected ranges. The majority of storage infrastructure has recovered successfully, with two storage stamps currently completing final recovery and integrity validation activities. On-site teams continue to perform validation and bring the remaining resources online. Services dependent on these storage stamps are expected to recover as these final activities are completed. The following services have been confirmed restored and are operating normally: Service Bus, App Service (Web Apps), Azure Site Recovery, Backup (MAB), Azure Cosmos DB, Azure Resource Manager, Data Explorer, Azure IoT Hub, Microsoft Defender for Cloud Apps, Azure Container Registry, Azure Policy, Azure NetApp Files, Azure Resource Graph, Azure Data Factory, and Azure Synapse Estimated Time to Resolution: With datacenter power back online, our network infrastructure fully restored and with majority of storage back online, remaining recovery is limited to two storage stamps completing the final stages of recovery and data integrity checks. We estimate full service restoration within approximately 3–4 hours. Individual services dependent on these stamps will come back online progressively as validation completes. Customer Guidance: If you have resources in paired or alternate regions, consider failing over traffic away from West US 2 until we confirm full recovery. Single-region workloads will recover automatically as infrastructure is restored — no customer action is required. We recommend pausing new deployments to West US 2 until this incident is resolved; use alternate regions if urgent. Monitor Azure Status (https://azure.status.microsoft/en-us/status) and Service Health in the portal for real-time updates specific to your subscriptions. Next update: We will provide our next status update within the next 60 minutes, or sooner if events warrant.

  4. investigating · 29 May 2026, 18:32

    Impact Statement: Starting at 04:27 UTC on 29 May 2026, a severe thunderstorm caused widespread utility power loss to our West US 2 datacenter facilities, resulting in a multi-service outage. Impact is currently region-wide, and we have not yet confirmed isolation to specific scale units or availability zones and are treating this as full-region impact until physical inspections confirm otherwise. Customers are experiencing service connectivity failures, timeouts, and elevated error rates across affected services. This includes an inability to deploy new resources or scale existing workloads, intermittent availability where some requests may succeed while others fail depending on which infrastructure nodes have recovered, and both data plane and control plane impact for services not yet restored. Affected Azure services include, but are not limited to: Azure Functions, Azure Database for MySQL Flexible Servers, Azure Database for PostgreSQL Flexible Server, Azure Databricks, Redis, Azure SQL, Azure Managed Grafana, Virtual Machines, Virtual Machine Scale Sets, Azure Kubernetes Service, Storage, Application Insights, Azure Data Factory, Azure Monitor, and Azure Log Analytics Current Status: Severe thunderstorms resulted to a complete utility power loss to multiple datacenter buildings simultaneously. Our backup generators activated but were unable to fully compensate. The generator transfer switchover was only partially successful, with some generator sets failing to synchronize under the sudden full facility load and others experiencing thermal protection shutdowns as ambient temperatures rose due to simultaneous cooling system failures. This cascading combination of power instability and cooling loss exceeded our designed N+1 redundancy for this failure mode. We are still determining the full extent of mechanical vs. electrical failures in the generator systems. HVAC systems are being restarted in sequence; ambient temperatures are returning to safe ranges but have not fully normalized, so we are staged-powering equipment to avoid thermal re-trips. Network devices and storage arrays are being power-cycled and validated in sequence, with storage dependencies as the primary remaining bottleneck due to required manual intervention and data integrity verification. Where possible, traffic is being redirected to healthy nodes, and on-site teams are conducting physical hardware inspections to identify components requiring replacement. The following services have been confirmed restored and are operating normally: Service Bus, App Service (Web Apps), Azure Site Recovery, Backup (MAB), Azure Cosmos DB, Azure Resource Manager, Data Explorer, Azure IoT Hub, Microsoft Defender for Cloud Apps, Azure Container Registry, Azure Policy, Azure NetApp Files, Azure Resource Graph, Azure Synapse Customer Guidance: • If you have resources in paired or alternate regions, consider failing over traffic away from West US 2 until we confirm full recovery. • Single-region workloads will recover automatically as infrastructure is restored — no customer action is required. • We recommend pausing new deployments to West US 2 until this incident is resolved; use alternate regions if urgent. • Monitor Azure Status (https://azure.status.microsoft/en-us/status) and Service Health in the portal for real-time updates specific to your subscriptions. Next update: We will provide our next status update within the next 60 minutes, or sooner, if events warrant.

  5. investigating · 29 May 2026, 18:30

    Impact Statement: Starting at 04:27 UTC on 29 May 2026, a severe thunderstorm caused widespread utility power loss to our West US 2 datacenter facilities, resulting in a multi-service outage. Impact is currently region-wide, and we have not yet confirmed isolation to specific scale units or availability zones and are treating this as full-region impact until physical inspections confirm otherwise. Customers are experiencing service connectivity failures, timeouts, and elevated error rates across affected services. This includes an inability to deploy new resources or scale existing workloads, intermittent availability where some requests may succeed while others fail depending on which infrastructure nodes have recovered, and both data plane and control plane impact for services not yet restored. Affected Azure services include, but are not limited to: Azure Functions, Azure Database for MySQL Flexible Servers, Azure Database for PostgreSQL Flexible Server, Azure Databricks, Redis, Azure SQL, Azure Managed Grafana, Virtual Machines, Virtual Machine Scale Sets, Azure Kubernetes Service, Storage, Application Insights, Azure Data Factory, Azure Monitor, and Azure Log Analytics Current Status: Severe thunderstorms resulted to a complete utility power loss to multiple datacenter buildings simultaneously. Our backup generators activated but were unable to fully compensate. The generator transfer switchover was only partially successful, with some generator sets failing to synchronize under the sudden full facility load and others experiencing thermal protection shutdowns as ambient temperatures rose due to simultaneous cooling system failures. This cascading combination of power instability and cooling loss exceeded our designed N+1 redundancy for this failure mode. We are still determining the full extent of mechanical vs. electrical failures in the generator systems. HVAC systems are being restarted in sequence; ambient temperatures are returning to safe ranges but have not fully normalized, so we are staged-powering equipment to avoid thermal re-trips. Network devices and storage arrays are being power-cycled and validated in sequence, with storage dependencies as the primary remaining bottleneck due to required manual intervention and data integrity verification. Where possible, traffic is being redirected to healthy nodes, and on-site teams are conducting physical hardware inspections to identify components requiring replacement. The following services have been confirmed restored and are operating normally: Service Bus, App Service (Web Apps), Azure Site Recovery, Backup (MAB), Azure Cosmos DB, Azure Resource Manager, Data Explorer, Azure IoT Hub, Microsoft Defender for Cloud Apps, Azure Container Registry, Azure Policy, Azure NetApp Files, Azure Resource Graph, Azure Synapse Customer Guidance: • If you have resources in paired or alternate regions, consider failing over traffic away from West US 2 until we confirm full recovery. • Single-region workloads will recover automatically as infrastructure is restored — no customer action is required. • We recommend pausing new deployments to West US 2 until this incident is resolved; use alternate regions if urgent. • Monitor Azure Status (status.azure.com) and Service Health in the portal for real-time updates specific to your subscriptions. Next update: We will provide our next status update within 1 hour, or sooner, if events warrant.