Final Update: Wednesday, 02 October 2019 17:11 UTC
We've confirmed that all systems are back to normal with no customer impact as of 10/2, 16:40 UTC. Our logs show the incident started on 10/2, 01:45 UTC and that during that time customers would have experienced intermittent missing of configured metric alerts.
-
Root Cause: The failure was due to performance bug in a backend system.
-
Incident Timeline: 14 Hours & 55 minutes
We understand that customers rely on Metric Alerts as a critical service and apologize for any impact this incident caused.
-Jeff