Final Update: Tuesday, 19 January 2021 18:57 UTC
We've confirmed that all systems are back to normal with no customer impact as of 1/19, 17:53 UTC. Our logs show the incident started on 1/19, 17:26 UTC and that during the 27 minutes that it took to resolve the issue impacted customers experienced ingested data latency and possible alerts misfiring due to latent data.
- Root Cause: The failure was due to a large unexpected ingress of data to a scale unit.
- Incident Timeline: 27 minutes - 1/19, 17:26 UTC through 1/19, 17:53 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
-Jeff