Final Update: Tuesday, 20 October 2020 17:17 UTC
We've confirmed that all systems are back to normal as of 10/20, 16:00 UTC. Our logs show the incident started on 10/15, 15:00 UTC and that during the time it took to resolve the issue customers experienced latency up to 14 hours when performing service management operations - such as create, update, delete – for Availability tests.
- Root Cause: The failure was due to recent configuration changes that caused some instances of a backend service to reach operational threshold.
- Incident Timeline: 10/15, 15:00 UTC through 10/20, 16:00 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
-Chandar