Final Update: Monday, 13 July 2020 01:08 UTC
We've confirmed that all systems are back to normal with no customer impact as of 07/13, 00:55 UTC. Our logs show the incident started on 07/12, 20:50 UTC and that during the ~4 hours that it took to resolve the issue customers in West US experienced failed or misfired alerts.
-
Root Cause: The failure was due to unhealthy backend service.
-
Incident Timeline: 4 Hours & 5 minutes - 07/12, 20:50 UTC through 07/13, 00:55 UTC
We understand that customers rely on Log Search Alerts as a critical service and apologize for any impact this incident caused.
-Anupama