We've confirmed that all systems are back to normal with no customer impact as of 3/12, 2:00 UTC. Our logs show the incident started on 3/10, 21:23 UTC and that during the 28 hours and 37 minutes that it took to resolve the issue customers experienced latent results for availability tests.
Root Cause: The failure was due to an invalid data value introduced to the backend causing intermittent delays of test results and possible missed results.
Incident Timeline: 28 Hours & 37 minutes - 3/10, 21:23 UTC through 3/12, 2:00 UTC
We understand that customers rely on Application Insights as a critical service and apologize for any impact this incident caused.
Initial Update: Wednesday, 11 March 2020 20:14 UTC
We are aware of issues within Application Insights and are actively investigating. Some customers may experience Data Latency in availability test results.
Next Update: Before 03/11 22:30 UTC
We are working hard to resolve this issue and apologize for any inconvenience. -Jeff