Final Update: Wednesday, 12 December 2018 15:28 UTC
We've confirmed that all systems are back to normal with no customer impact as of 12/12, 15:00 UTC. Our logs show the incident started on 12/12, 08:30 UTC and that during the 6 hours and 30 minutes that it took to resolve the issue a very small subset of customers who have configured their web tests in West US region may have experienced alerting failures.
-
Root Cause: The failure was due to performance degradation in one of our backend services.
-
Incident Timeline: 6 Hours & 30 minutes - 12/12, 08:30 UTC through 12/12, 15:00 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
-Varun