Final Update: Sunday, 27 January 2019 11:10 UTC
We've confirmed that all systems are back to normal with no customer impact as of 1/27, 08:59 UTC. Our logs show the incident started on 1/26, 23:15 UTC and that during the approximately 9 hours 44 minutes that it took to resolve the issue, all customers in Southeast Asia and West US, and ~25% customers in East US will not be able to see (public preview) custom metrics between the start and end times of the incident.
-
Root Cause: The failure was due to a backend cluster becoming unhealthy.
-
Incident Timeline: 9 Hours & 44 minute - 1/26, 23:15 UTC through 1/27, 08:59 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
-Anmol