Major Cloud Providers – Monthly Outage Recap September

MS Azure

9/26/18 Multiple Services – South East Asia – Mitigated Between 09:52 and 12:28 UTC on 26 Sep 2018, a subset of customers in Southeast Asia may have experienced latency or difficulties connecting to Virtual Machines and/or Cloud Service resources hosted in this region. A number of related services may also have experienced some downstream impact.

9/19/18 Log Analytics – Latency, Timeouts and Service Management Failures Between 10:54 and 18:45 UTC on 19 Sep 2018, a subset of customers using Log Analytics and/or other downstream services may have experienced latency, timeouts or service management failures.

9/15/18 Virtual Machines – Metrics Unavailable

Between 06:30 UTC on 15 Sep 2018 and 22:35 UTC on 17 Sep 2018, a subset of customers may have experienced difficulties viewing Virtual Machines and/or other compute resource metrics.

9/5/18 RCA – Azure Active Directory – Multiple Regions

Between as early as 09:00 UTC on Sep 05 and as late as 05:50 UTC on Sep 10, a small subset of Azure Active Directory (AAD) customers may have experienced intermittent authentication failures when connecting to resources in the following regions: Japan, India, Australia, South Brazil, and East US 2.

DownDetector

9/5/18 Problems at Microsoft Azure

Microsoft Azure is having issues since 2:25 PM EDT. Most reported problems:

  • Cloud services (46%)
  • Virtual machines (26%)
  • Website hosting (26%)

9/4/18 Azure South Central US – Preliminary RCA In the early morning of September 4, 2018, high energy storms hit southern Texas in the vicinity of Microsoft Azure’s South Central US region. Multiple Azure datacenters in the region saw voltage sags and swells across the utility feeds. At 08:42 UTC, lightning caused electrical activity on the utility supply, which caused significant voltage swells. 

Customer impact:
[1] Impact to resources in South Central US

[2] Impact to Azure Service Manager (ASM)

[3] Impact to Azure Active Directory (AAD)

[4] Impact to Visual Studio Team Services (VSTS)

[5] Impact to Azure Application Insights

[6] Impact to the Azure status page

7] Impact to Azure subscription management

DownDetector

9/4/18 Problems at Microsoft Azure

Microsoft Azure is having issues since 6:22 PM EDT. Most reported problems:

  • Website hosting (46%)
  • Cloud services (40%)
  • Virtual machines (13%)

DownDetector

9/4/18 Problems at Microsoft Azure

Microsoft Azure is having issues since 8:21 AM EDT. Most reported problems:

  • Website hosting (41%)
  • Cloud services (38%)
  • Virtual machines (19%)

 

AWS

9/26/18 VMware Cloud on AWS – SDDC Create/Update/Delete operations failing VMware Cloud on AWS is experiencing failures with SDDC create/update/delete operations. Impact: SDDC Create/Update/Delete operations are failing. We do not expect this issue to impact existing workloads in existing SDDCs.

9/20/18 VMware Cloud on AWS – Intermittent SDDC Create/Update/Delete operations failing VMware Cloud on AWS is experiencing intermittent failures with SDDC create/update/delete operations. Impact: SDDC Create/Update/Delete operations are failing. We do not expect this issue to impact existing workloads in existing SDDCs. 

9/10/18 VMWare SDDC Create/Update/Delete operations failing on VMware Cloud on AWS

Start Time: September 10, 2018 02:18 PM UTC
End Time: September 10, 2018 08:07 PM UTC

 

Other Platforms

9/28/18 SAP Cloud Platform Europe (Netherlands) [cf-eu20] – Service Advisory Between approximately 10:30 and 16:40 UTC on 28 Sep 2018, customers may have experienced a disruption on the Europe (Netherlands) region of the SAP Cloud Platform. Lifecycle management operations were slower than usual. 

9/27/18 SAP Cloud Platform Multiple Regions – Service Advisory Between approximately 12:14 and 15:48 UTC on 27 Sep 2018, customers may have experienced a disruption on the KSA (Riyadh) and Russia (Moscow) regions of the SAP Cloud Platform. Customers may have been unable to create new virtual machines.

9/26/18 SAP Cloud Platform Australia (Sydney) [neo-ap1] – Identity Authentication, accounts.sap.com – Service Advisory Between approximately 09:03 and 13:10 UTC on 26 Sep 2018, customers may have experienced a disruption that was impacting authentication with the SAP ID Service and tenants of SAP Cloud Platform Identity Authentication based in Australia (Sydney).

9/25/18 SAP Cloud Platform US East (Sterling) [neo-us3] – Service Advisory  Between approximately 07:30 UTC on 21 Sep 2018 and 08:35 UTC on 25 Sep 2018, customers may have experienced high latency while connecting application to databases on the US East (Sterling)  region of the SAP Cloud Platform due to an issue with infrastructure. 

9/24/18 VMware Log Intelligence Availability issue VMware Log Intelligence Service is experiencing Availability issue. Impact: User may not be able to access the service or may experience trouble while accessing the service. 

9/23/18 SAP Cloud Platform Europe (Rot) [neo-eu1] – Service Advisory Between approximately 19:43 UTC on 22 Sep 2018 and 14:33 UTC on 23 Sep 2018, customers may have experienced a disruption on the Europe (Rot) [neo-eu1] region of the SAP Cloud Platform. Our monitors have indicated a possible disruption on the Europe (Rot) region of the SAP Cloud Platform (hana.ondemand.com) impacting lifecycle operations such as starting applications. Deployment of application does work, however once application is deployed, it can’t be started. 

9/21/18 SAP Cloud Platform Europe (Rot) [cf-eu1] – Service Advisory Between approximately 08:00 and 14:40 UTC on 21 Sep 2018, customers may have experienced a disruption on the Europe (Rot) region of the SAP Cloud Platform 

Europe (Rot) [cf-eu1] – Service Advisory Between approximately 01:41 and 05:26 UTC on 21 Sep 2018, customers may have experienced a disruption on the Europe (Rot) region of the SAP Cloud Platform due to an infrastructure issue.

9/20/18 SAP Cloud Platform Europe (Rot) [cf-eu1] – Service Advisory Between approximately 12:56 and 16:20 UTC on 20 Sep 2018, customers may have experienced a disruption on the Europe (Rot) region of the SAP Cloud Platform. Customers may have been unable to create backing services during this time. Additionally, between approximately 14:05 and 14:40 UTC, customers may have been unable to access applications and services in the region.

9/17/18 VMware Cloud backend service Availability issue Impact: User may not be able to access the service or experiencing trouble when logging into the service. Start Time: September 17, 2018 08:55 AM UTC End Time: September 17, 2018 10:10 AM UTC

9/17/18 SAP Cloud Platform US East (Sterling) [neo-us3] – Service Advisory Between approximately 16:15 and 18:38 UTC on 17 Sep 2018, customers may have experienced a disruption on the US East (Sterling) region of the SAP Cloud Platform

9/16/18 VMware Cloud backend service Availability issue Start Time: September 16, 2018 02:00 PM UTC End Time: September 16, 2018 03:30 PM UTC

9/14/18 Google Cloud Machine Learning Incident #18002 AutoML Natural Language failing to train models. Incident began at 2018-09-14 08:31 and ended at 2018-09-14 09:38

9/14/18 Problems at Go Daddy

Go Daddy is having issues since 1:57 AM EDT.

Most reported problems:

  • Email (65%)
  • Domains (22%)
  • Hosting (12%)

9/13/18 Europe (Rot) [cf-eu1] – Service Advisory Between approximately 08:24 and 11:14 UTC on 13 Sep 2018, customers may have experienced a disruption on the Europe (Rot) region of the SAP Cloud Platform

9/11/18 Google Cloud Networking Incident #18015 We are investigating a major issue with Google Cloud Networking in europe-north1-c. Incident began at 2018-09-11 02:04 and ended at 2018-09-11 02:54

9/11/18 Google Cloud Functions Incident #18002

The Google Cloud Functions service is experiencing errors when updating functions via gcloud. Incident began at 2018-09-11 08:39 and ended at 2018-09-11 09:11

9/11/18 SAP Cloud Platform

Europe (Rot) [cf-eu1] – Service Advisory Between approximately 15:55 and 21:25 UTC on 11 Sep 2018, customers may have experienced a disruption on the Europe (Rot) region of the SAP Cloud Platform. Customers may have been unable to perform lifecycle management operations.

US East (Ashburn) [neo-us1] – Service Advisory Between approximately 08:26 and 08:35 UTC on 11 Sep 2018, customers may have experienced a disruption on the US East (Ashburn) region of the SAP Cloud Platform

US East (Sterling) [neo-us3] – Service Advisory Between approximately 08:26 and 08:35 UTC on 11 Sep 2018, customers may have experienced a disruption on the US East (Sterling) region of the SAP Cloud Platform

Europe (Rot) [cf-eu1] – Service Advisory Between approximately 08:01 UTC on 10 Sep 2018 and 05:06 UTC on 11 Sep 2018, customers may have experienced a disruption on the Europe (Rot) of the SAP Cloud Platform

9/11/18 Problems at Go Daddy

Go Daddy is having issues since 3:21 PM EDT. Most reported problems:

  • Domains (45%)
  • Web tools (32%)
  • Hosting (22%)

9/10/18 SAP Cloud Platform Europe (Rot) [cf-eu1] – Service Advisory Between approximately 08:01 and 09:35 UTC on 10 Sep 2018, customers may have experienced a disruption on the Europe (Rot) [cf-eu1] region of the SAP Cloud Platform. Our monitors have indicated a disruption on the Europe (Rot) [cf-eu1] region of the SAP Cloud Platform impacting lifecycle management operations,however that will not impact already existing and deployed applications. 

9/9/18 SAP Cloud Platform US East (Ashburn) [neo-us1] – Service Advisory Between approximately 08:03 and 13:42 UTC on 09 Sep 2018, customers may have experienced a disruption on the US East (Ashburn) region of the SAP Cloud Platform

9/5/18 Google Cloud Console Incident #18005 We are experiencing an issue with Google Cloud Build for users trying to create triggers via the Cloud Console beginning at Wednesday, 2018-09-05 12:25 US/Pacific. Affected users may notice the Add trigger button is not responding. Incident began at 2018-09-05 12:25 and ended at 2018-09-06 14:28

9/5/18 SAP Cloud Platform Europe (Frankfurt) [cf-eu10] – Service Advisory Between approximately 13:06 and 15:56 UTC on 05 Sep 2018, customers may have experienced a disruption on the Europe (Frankfurt) region of the SAP Cloud Platform. Customers may have been unable to automate the scheduling of jobs.

9/5/18 Issue while Accessing Service from VMware’s cloud portal We are currently experiencing issues when attempting to login to the consoles of all VMware Cloud Services.  Impact: User may experience login failures intermittently. Existing workloads are not impacted. Sep 5, 08:14 UTC

9/4/18 IBM Cloud Functions Issues with Cloud Functions user interface users of the Functions browser UI are experiencing intermittent behavior where Actions, Triggers or Monitoring pages are blank September 4, 2018 1:00 PM PDT

9/3/18 SAP Cloud Platform China (Shanghai) [neo-cn1] – Service Advisory Between approximately 13:50 and 14:33 UTC on 03 Sep 2018, customers may have experienced a disruption on the China (Shanghai) [neo-cn1] region of the SAP Cloud Platform

Europe (Rot) [cf-eu1] – Service Advisory Between approximately 10:33 and 11:57 UTC on 03 Sep 2018, customers may have experienced a disruption on the Europe (Rot) [cf-eu1] region of the SAP Cloud Platform. Customers may have been unable to perform lifecycle management operations.

Europe (Amsterdam) [neo-eu3] – Service Advisory Between approximately 06:00 and 07:00 UTC on 03 Sep 2018, customers may have experienced a disruption on the Europe (Amsterdam) [neo-eu3] region of the SAP Cloud Platform. Customers may have been unable to access the Web IDE.

9/4/18 Google Cloud Developer Tools Incident #18003 Increased error rate of high latency for Google Container Registry API calls. Incident began at 2018-09-04 04:22 (all times are US/Pacific).

9/4/18 Google Cloud Storage Incident #18003

Increased error rate for Google Cloud Storage. We are seeing intermittent errors for requests to Google Cloud Storage in the US region. Our Engineering Team is continuing mitigation work. Incident began at 2018-09-04 04:22 (all times are US/Pacific).

9/2/18 SAP Cloud Platform Europe (Netherlands) [cf-eu20] – Service Advisory Between approximately 10:00 and 12:29 UTC on 02 Sep 2018, customers may have experienced a disruption on the Europe (Netherlands) [cf-eu20] region of the SAP Cloud Platform. Customers might have witnessed issues while pushing application to CF.

Multiple Regions – Service Advisory Between approximately 02:08 UTC and 02:18 UTC on 02 Sep 2018, customers may have experienced a disruption on multiple regions of SAP CP due to a network issue.

US East (Sterling) [neo-us3] – Service Advisory  Between approximately 09:08 and 10:05 UTC on 02 Sep 2018, customers may have experienced a disruption on the US East (Sterling) [neo-us3] region of the SAP Cloud Platform. Customers may have been unable to access their SAP Cloud Platform Virtual Machines.