We've confirmed that all systems are back to normal with no customer impact as of 1/19, 17:53 UTC. Our logs show the incident started on 1/19, 17:26 UTC and that during the 27 minutes that it took to resolve the issue impacted customers experienced ingested data latency and possible alerts misfiring due to latent data.
Root Cause: The failure was due to a large unexpected ingress of data to a scale unit.
Incident Timeline: 27 minutes - 1/19, 17:26 UTC through 1/19, 17:53 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
Initial Update: Tuesday, 19 January 2021 18:32 UTC
We are aware of issues within Application Insights and are actively investigating. Some customers in East US region may experience delayed or missed Log Search Alerts and Data Latency.
Next Update: Before 01/19 20:00 UTC
We are working hard to resolve this issue and apologize for any inconvenience. -Jeff