Using Azure Site Recovery & Microsoft Defender for Servers to securely failover to malware-free VMs
Published Nov 29 2023 12:05 PM 2,858 Views

The last few years have witnessed an increase in the number of ransomware attacks aimed at disrupting businesses to extract a ‘ransom’ from the victims. As a result, organizations have employed various measures to ensure their data is well protected from any such attacks and there are ways to recover effectively. Business Continuity and Disaster Recovery (BCDR) forms an important part of the overall ransomware & malware protection strategy to minimize data loss and allow affected systems to recover as quickly as possible. We had earlier released a solution that demonstrates integration of Azure Backup with Microsoft Defender for Cloud for detection and response to alerts to accelerate response.

In this article, we will see how Azure Site Recovery offers an automated way to help you ensure that all your DR data, to which you would fail over, is safe and free of any malware using Microsoft Defender for Cloud.

Azure Site Recovery helps ensure business continuity by keeping business apps and workloads running during outages. Site Recovery replicates workloads running on physical and virtual machines (VMs) from a primary site to a secondary location. After the primary location is running again, you can fail back to it. Azure Site Recovery provides Recovery Plans to impose order, and automate the actions needed at each step, using Azure Automation runbooks for failover to Azure, or scripts.

Microsoft Defender for Cloud is a cloud-native application protection platform (CNAPP) with a set of security measures and practices designed to protect cloud-based applications from various cyber threats and vulnerabilities. 

 

Solution Details

In this solution, an Azure Site Recovery (ASR) recovery plan is utilized to execute a at the time of failover to automatically initiate Microsoft Defender on the failed-over virtual machines. Microsoft Defender then scans the new virtual machine, which is created as a result of the failover, to ensure that it is free of malware. In case of issues like malware being detected in the newly failed over VM, an alert is created in Defender for further actions.

This solution also provides an optional mitigation which can help you to automatically fail over to an on older recovery point till a malware-free failed-over VM is achieved. Any malware infected (failed-over) virtual machines that are created in the process are also automatically deleted.

This solution can be used for Azure to Azure (A2A) or VMWare to Azure (V2A) scenarios.

UtsavRaghuvanshi_0-1701287821063.png

 

How it works

Pre-requisites:

 

For virtual machines protected using ASR, follow the steps mentioned below to recover your data from a recovery point which is free of malware.

  1. Create an Azure Automation account.
  2. In your automation account, create two automation runbook for executing the detection scheduler script and the ransomware detection script. You can download these scripts from this location on GitHub-
    1. Runbook for the detection scheduler script: Use the script SchedulerForRansomwareDetection.ps1 from Github to create the first runbook. This script installs Defender on all virtual machines in the subscription of the failed over VM. It also creates a schedule for the RansomwareDetection runbook (discussed below).
    2. Runbook for the ransomware detection script: Download the script RansomwareDetector.ps1 from Github for using in the second runbook. This script, whenever executed, checks if any alerts are created for the failed over VM in Microsoft Defender for Cloud. This runbook will run as per the schedule created by the scheduler script discussed above. You can also modify certain parameters like changing the frequency of the scan through the automation account. Moreover, if opted for, it also allows you to automatically delete the failed over VM and initiate a failover to a previous point in time.

UtsavRaghuvanshi_1-1701287821082.png

 

  1. In the automation account, define the following variables under the ‘Variables’ item on the left navigation pane:
    1. VaultName: This is the name of the of the Recovery Services vault to which the virtual machines are protected with ASR.
    2. VaultResourceGroup: This is the Resource Group of the of the Recovery Services vault to which the virtual machines are protected with ASR.
    3. VaultSubscription: This is the Subscription of the of the Recovery Services vault to which the virtual machines are protected with ASR. Provide the ID of the subscription only.
    4. ChangePit Choose if you wish to enable automatic failover to an older point in time for recovery if any high-severity alert (this condition can be customized in the RansomwareDetector script) is detected in the current failed-over virtual machine (the current failed-over machine will be deleted). Enter ‘True’ if you want to enable automatic failover to older recovery points, else choose ‘False’.

UtsavRaghuvanshi_7-1701288154068.png

 

  1. The Azure Automation account’s identities must have owner permissions on the subscription where the failed-over VMs are getting created. To do this, you can either enable the system-assigned managed identity for the account or associate a user-assigned managed identity with the account.

UtsavRaghuvanshi_3-1701287821097.png

 

 

  1. Create an ASR recovery plan, and add a post action that would run the detection scheduler runbook (for the script SchedulerForRansomwareDetection.ps1) for the virtual machine after failover. Please note that the detection script does not need to be added as a post-action and will be executed from the scheduler script.

UtsavRaghuvanshi_4-1701287821104.png

 

  1. Trigger the failover operation in ASR, which creates the failed-over virtual machine in the secondary environment in Azure.

  2. Once the failover is complete, the postscript runs automatically and sets up policies and configurations that install and set up Microsoft Defender on the new virtual machine. As part of the process, agents required for running Defender are also installed.

Note: It is important to note that a key step of this process is to enable auto-provisioning of Defender, which enables Defender for all virtual machines in the subscription and scans them for malware.

  1. You would need to check Microsoft Defender for Cloud for any alerts that get created for this VM if any malware is detected.

UtsavRaghuvanshi_5-1701287821113.png

  1. If you have opted for the script to choose older recovery points in case of high severity alerts (or custom alert conditions defined in the RansomwareDetector script) in the failed over VM, the script will start identifying the replicated item for which this alert was raised. It will automatically performs a 'Change PIT' operation to failover to a previous recovery point that created before the detection time of the security alert. The default duration for this is 1-day older recovery points, however, you can configure it as per your needs.

  2. The newly created VM will once again be scanned and checked for any ransomware and the script will continue to iterate until we find a secure recovery point and create a malware-free VM.

 

Co-Authors
Version history
Last update:
‎Nov 29 2023 12:05 PM
Updated by: