Final Update: Wednesday, 17 February 2021 00:40 UTC
We've confirmed that all systems are back to normal with no customer impact as of 2/17, 00:20 UTC. Our logs show the incident started on 2/16, 19:45 UTC and that during the ~4 hours 25 min hours that it took to resolve the issue, customers in East US2 experienced data gaps and incorrect alert activation.
- Root Cause: The failure was due to one of the backend services becoming unhealthy due to incorrect configuration which was impacting telemetry ingestion into Log analytics workspaces.
- Incident Timeline: 4 Hours & 25 minutes - 2/16, 19:45 UTC through 2/17, 00:20 UTC
We understand that customers rely on Azure Log Analytics as a critical service and apologize for any impact this incident caused.
-Anupama