We've confirmed that all systems are back to normal with no customer impact as of 09/27, 10:52 UTC. Our logs show the incident started on 09/27, 08:00 UTC and that during the 2 hours & 52 minutes that it took to resolve the issue some of customers experienced alerting failure.
Root Cause: The failure was due to deployment in backend service
Incident Timeline: 2 Hours & 52 minutes - 09/27, 08:00 UTC through 09/27, 10:52 UTC
We understand that customers rely on Metric Alerts as a critical service and apologize for any impact this incident caused.
Initial Update: Friday, 27 September 2019 10:33 UTC
We are aware of issues within Metric Alerts and are actively investigating. Some customers may experience Alerting failure.
Work Around: None
Next Update: Before 09/27 14:00 UTC
We are working hard to resolve this issue and apologize for any inconvenience. -Naresh