Final Update: Friday, 17 January 2020 16:57 UTC
We've confirmed that all systems are back to normal with no customer impact as of 1/17, 14:33 UTC. Our logs show the incident started on 01/11 08:20 UTC and that during the 6 days and 6 hours that it took to resolve the issue all customers who attempted to access the page containing metrics experienced an authentication issue.
- Root Cause: The failure was due to a code regression in a recent deployment.
- Incident Timeline: 6 Days 6 Hours & 13 minutes - 1/11, 08:20 UTC through 1/17, 14:33 UTC
We understand that customers rely on Metric Alerts as a critical service and apologize for any impact this incident caused.
-Jeff