We've confirmed that all systems are back to normal with no customer impact as of 06/28, 03:33 UTC. Our logs show the incident started on 06/27, 21:20 UTC and during the 6 hours and 13 minutes that it took to resolve the issue. Some customers using Application Insights components in West US2 who may have experienced intermittent data latency and incorrect alert activation.
Root Cause: We determined that some instances of a backend service Application Insights relies on became unhealthy. This resulted in the latency and missed alerts customers may have experienced.
Mitigation: We restarted the backend services to a healthy state that mitigating the issue.
Incident Timeline: 6 Hours & 13 minutes - 06/27, 21:20 UTC through 06/28, 03:33 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
Initial Update: Tuesday, 28 June 2022 03:01 UTC
We are aware of issues within Application Insights and are actively investigating. Some customers may experience delayed or missed Log Search Alerts and Data Latency.
Work Around: None
Impact start time: 6/27/2022 21:30 UTC
Next Update: Before 06/28 04:30 UTC
We are working hard to resolve this issue and apologize for any inconvenience. -muhkhan