Final Update: Friday, 01 April 2022 22:13 UTC
We've confirmed that all systems are back to normal with no customer impact at the moment. Our logs show the incident started on 4/1, 20:25 UTC and that during the 35m that it took to resolve the issue some of the customers would have experienced data access, query failures and alerting issues. At present all of our services are running as expected.
- Root Cause: The failure was due to one of the machines going into unhealthy state and unable to serve the requests.
- Incident Timeline: 35 minutes - 4/1, 20:25 UTC through 4/1, 21:00 UTC
We understand that customers rely on Azure Log Analytics as a critical service and apologize for any impact this incident caused.
-Arvind Yadav