Final Update: Friday, 24 January 2020 07:36 UTC
We've confirmed that all systems are back to normal with no customer impact as of 01/24, 07:20 UTC. Our logs show the incident started on 01/23, 23:55 UTC and that during the 7 hours and 25 minutes that it took to resolve the issue ~80% of customers in West US2 may have experienced issues while querying data, delayed or missing alerts.
- Root Cause: The failure was due to one of the backend service getting overloaded. The backend service was unable to scale out due to ongoing OS Patching.
- Incident Timeline: 7 Hours & 25 minutes - 01/23, 23:55 UTC through 01/24, 07:20 UTC
We understand that customers rely on Azure Log Analytics as a critical service and apologize for any impact this incident caused.
-Anmol