Measure and Respond : Running a high availability service requires world class measurement and monitoring capabilities. We continually measure, analyze and report on key service health metrics and success criteria for each of the AAD services. We develop and tune monitoring and metrics for each scenario both within each AAD service and across services that allow us to be sure it is working and if not, to take rapid action to recover. The most important metric we track is how quickly we can detect and mitigate a customer or live site issue. We heavily invest in monitoring and alerts to minimize time to detect (TTD Target: <5 mins) and operational readiness to minimize time to mitigate (TTM Target: <30 mins)
Secure operations: Azure AD is compliant with ISO 27001 and FISMA standards. We employ operational controls such as 2 factor authentication for any operation, and audit all operations. In addition we use a just in time elevation system to grant necessary access for any operational task on demand on a temporary basis.Summary: This combination of a well-planned, geo-distributed architecture with extensive monitoring and automated rerouting, failover and recovery enables us to deliver enterprise level availability and performance to customers in >50 countries all around the world. I hope this was useful and interesting! If you have questions or feedback, please let us know! Regards, Anandhi
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.