Final Update: Tuesday, 18 August 2020 00:19 UTC
We've confirmed that all systems are back to normal with no customer impact as of 8/18, 00:10 UTC. Our logs show the incident started on 8/17, 20:30 UTC and that during the 3 hours and 40 minutes that it took to resolve the issue 5,873 subscriptions experienced intermittent data latency, data gaps and incorrect alert activation.
- Root Cause: The failure was due to a dependent backend component reaching an operational threshold. A manual scaling of the resource was performed to mitigate the issue.
- Incident Timeline: 3 Hours & 40 minutes - 8/17, 20:30 UTC through 8/18, 00:10 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
-Jeff