Final Update: Tuesday, 02 June 2020 01:45 UTC
We've confirmed that all systems are back to normal with no customer impact as of 6/2, 01:41 UTC. Our logs show the incident started on 6/2, 00:00 UTC and that during the one hour 41 minutes that it took to resolve the issue 9,500 customers experienced failed queries and delayed or missing alerts.
- Root Cause: The failure was due to a backend dependency that was in a bad state.
- Incident Timeline: 1 Hours & 41 minutes - 6/2, 00:00 UTC through 6/2, 01:41 UTC
We understand that customers rely on Azure Log Analytics as a critical service and apologize for any impact this incident caused.
-Jeff
Update: Tuesday, 02 June 2020 01:22 UTC
Root cause has been isolated to a backend service impacting issue which was impacting a users ability to query and causing misfiring alerts in East US, East US 2, Southeast Australia, North EU, and East Japan. To address this issue we have reset the backend service.
- Work Around: None
- Next Update: Before 06/02 03:30 UTC
-Jeff
Initial Update: Tuesday, 02 June 2020 00:39 UTC
We are aware of issues within Log Analytics and are actively investigating. Some customers in East Japan may experience failures when querying for data from their workspace.
- Work Around: None
- Next Update: Before 06/02 03:00 UTC
We are working hard to resolve this issue and apologize for any inconvenience.
-Jeff