Final Update: Tuesday, 28 June 2022 03:53 UTC
We've confirmed that all systems are back to normal with no customer impact as of 06/28, 03:33 UTC. Our logs show the incident started on 06/27, 21:20 UTC and during the 6 hours and 13 minutes that it took to resolve the issue. Some customers using Application Insights components in West US2 who may have experienced intermittent data latency and incorrect alert activation.
- Root Cause: We determined that some instances of a backend service Application Insights relies on became unhealthy. This resulted in the latency and missed alerts customers may have experienced.
- Mitigation: We restarted the backend services to a healthy state that mitigating the issue.
- Incident Timeline: 6 Hours & 13 minutes - 06/27, 21:20 UTC through 06/28, 03:33 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
-Vignesh