Final Update: Wednesday, 15 January 2020 18:33 UTC
We've confirmed that all systems are back to normal with no customer impact as of 1/15, 18:05 UTC. Our logs show the incident started on 1/15, 17:31 UTC and that during the 34 minutes that it took to resolve the issue 4.3% of customers experienced failed or timing out queries as well as temporary log ingestion delay and possible misfiring or not firing alerts.
- Root Cause: The failure was due to a back end component becoming unhealthy. The component was taken out of rotation and component health was restored.
- Incident Timeline: 34 minutes - 1/15, 17:31 UTC through 1/15, 18:05 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
-Jeff