We've confirmed that all systems are back to normal with no customer impact as of 09/28, 22:00 UTC.Our logs show the incident started on 09/24, 20:00 UTC and that during the 4 days and 2 hours that it took to resolve the issue customers experienced issues with charts not being loaded for dynamic alerts based on metrics.
-
Root Cause: The failure was identified to be caused as part of recent deployment to our service.
-
Incident Timeline: 4 Days & 2 Hours - 09/24, 20:00 UTC through 09/28, 22:00 UTC
We understand that customers rely on Metric Alerts as a critical service and apologize for any impact this incident caused.
-Jayadev