Final Update: Tuesday, 27 April 2021 00:04 UTC
We've confirmed that all systems are back to normal with no customer impact as of 04/26, 23:20 UTC. Our logs show the incident started on 04/26, 20:20 UTC and that during the 3 hours that it took to resolve the issue some customers using Application Insights in West US 2 Region may have experienced intermittent metric data gaps and incorrect alert activation. Customers using Log Analytics Workspace-based Application Insights resources may experience log data gaps and incorrect alert activation. Additionally, customers using Custom Metrics (Preview) may experience data gaps and incorrect alert activation.
- Root Cause: The failure was due to issues with one of the backend services.
- Incident Timeline: 3 Hours - 04/26, 20:20 UTC through 04/26, 23:20 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
-Jayadev
Update: Monday, 26 April 2021 22:42 UTC
Root cause has been isolated to data loss for metric data in Application Insights. We
found a root cause where the dependent service responsible for processing this
data had a storage account which became unhealthy and unavailable to process
any data. We are recreating this storage account, and once it is validated,
this issue should be mitigated.
- Work Around: none
- Next Update: Before 04/27 01:00 UTC
-Ian
Initial Update: Monday, 26 April 2021 20:56 UTC
We are aware of issues within Application Insights and are actively investigating. Some customers in West US 2 may experience data loss for metric data. This may cause failed or misfired alerts.
- Work Around: none
- Next Update: Before 04/26 23:00 UTC
We are working hard to resolve this issue and apologize for any inconvenience.
-Ian