Blog Post

Data Architecture Blog

7 MIN READ

CI CD in Azure Synapse Analytics Part 4 - The Release Pipeline

Bradley_Ball

Microsoft

Feb 01, 2021

Here's a quick review of the road so far:

CI CD in Azure Synapse Analytics Part 1

Creating an Azure DevOps project
Linking our Azure Synapse Analytics environment to that Project via Git
Validating that our Azure DevOps Repo was populated with our Azure Synapse Analytics environment

CI CD in Azure Synapse Analytics Part 2:

Create a new branch on our Repo
Edit our Azure Synapse Analytics environment
- Specifically my SQL scripts have demos all over the place and Buck Woody said I have to clean up my very messy room .... Azure Synapse Analytics environment
Create a Pull Request in Azure Synapse Analytics to merge our new branch with the main
Approve the Pull Request in Azure DevOps
Validate our main branch is updated in our Azure Synapse Analytics Environment

CI CD in Azure Synapse Analytics Part 3

Create an Artifact pipeline
- This is to create an Artifact we can use to deploy to another environment

This time we will:

Give our Azure DevOps Service Principal access to our Azure Synapse Workspace
Validate or Give our Azure DevOps Service Principal Storage Contributor & User Access Administrator (*This is only if your storage account was provisioned before you created your Synapse Workspace or if you connect your Dev, QA, and Prod to the same ADLS G2 storage account. If you create your Synapse Workspace and Storage account from an ARM template deployed from DevOps, then your DevOps Service Principal will have Owner on the Storage Account and that gives the Service Principal User Access Administrator capabilities.)
Create the release pipeline
Validate the deployment
*If you have SQL Provisioned Pools as part of your deployment pause them, because they will be created on deployment

Before we create our release pipeline we need to make sure the Azure Service Principal account has the proper permissions. If we do not, you well get cryptic errors with a GUID and something about "does not have permissions to blah blah blah". Trust me, it is super annoying.

The two permission we will need are located in two different places, the first is in our Azure Synapse Workspace, specifically using Azure Synapse Studio. The second will be in the Storage Account for our Azure Data Lake Gen 2 that is the default ADLS connection for our Azure Synapse Studio.

First open your Azure Synapse Studio and navigate to the Management Blade.

Now Click on Access Control. The user that created the Azure Synapse Workspace is automatically given the role of Synapse Administrator, the second user with that role will be the Managed Identity for the Azure Synapse Workspace. We need to add our Azure DevOps Service Principal to this role.

*A quick note, I prefer to manage this in an Azure Active Directory Group. In this blog I will show how to add the account directly to the Azure Synapse Workspace. However, the best practice would be to have an AAD group that is granted Synapse Administrator and then add the role to that group.

Click the +Add link.

Now we will type in the Azure DevOps Project name. If your project URL was https://bobsburgers.visualstudio.net and your project name was Azure Synapse Studio CI CD you would type in bobsburgers-Azure Synapse Studio CI CD. This would show you the Service Principal GUID following that name.

Click the name as it appears and then click the Apply Button.

Now open a browser and navigate to the Azure Portal. In the search window at the type Storage Accounts. Select the storage account that you are using as your default ADLS Storage Account for your Azure Synapse Workspace. Click the Access Control (IAM) blade. Click +Add, then click Add role assignment.

Select Storage Blob Data Contributor for Role. In the Select text box, type in the Azure DevOps Service Principal the same way we did in the for the Synapse Administrator role.

Repeat the previous steps, except this time specify the User Access Administrator Role.

Next we will navigate to our Azure DevOps Project. Select pipelines, Releases, and New Pipeline.

Click on Empty Job. Then click on Add an artifact.

Ensure our project is selected. Select the name of the Build Pipeline that we created in our previous blog (Or whatever YOU wanted to name your Build Pipeline because my naming conventions do not define you!). Select the Latest Build, and click Add.

Rename the Release Pipeline to reflect what we are doing. We selected Deploy Dev Release. Clock on the Stage1 link 1 job, 0 task

Click the + plus sign on Agent Job. In the search text box type "Synapse", the Synapse workspace deployment task will appear if you have installed it from the Marketplace. If not, FEAR NOT! You should see a link for it below under the heading Marketplace. Click on it to install the task to your Azure DevOps project.

Once you have added the task, click on the task. We will fill out the Template, Template parameters, Synapse Workspace connection type, Synapse Workspace name, and we will get to OverrideArmParameters in a moment. That will a lot more details.

First click on the ... ellipses by Template.

Navigate through the build pipeline, ASW_Drop, ARM, to the TemplateForWorkspace.json. Select the .json file and click OK.

Now repeat the same steps for the Template Parameters text box, this time selecting the TemplateParametersForWorkspace.json file.

Under Synapse workspace connection type, select the Azure Subscription that contains the environment where we are deploying our release. Specify the Resource Group and the Azure Synapse Workspace name.

Now we begin to focus on the override parameters. First we will travel back to our Repo and look at the TemplateForWorkspace.json. Any string that has a type "secureString" will need to have an override parameter. Depending on the level of development you have done, you may have many of these strings, in our example we have two. The default workspace connection to the Provisioned SQL Pools and a Linked Service I created to an Azure SQL Database.

Dear Reader, you are wondering where to find those. You are in luck! Navigate to your Azure Synapse Analytics Workspace, click on the Manage blade, then Linked services. Now click on the { } Code symbol after the name of the linked service that is a type securedString.

This will open a view of the JSON in that contains the data we need. Copy the text between the double quotes. DO NOT SELECT THE DOUBLE QUOTES!! JUST THE STRING BETWEEN THE DOUBLE QOUTES!

Sorry for yelling, but we will use this string soon and the double quotes "" will cause it to fail.

Now let's do the same thing for the Azure SQL Database.

Now navigate back to our Azure DevOps Release pipeline. Click on Variables, then click the + Add button 3 times. We will be creating two variables based on the secureStrings in our JSON file. We will also be creating a system.debug value to give us extra information in our release pipeline, it's value is True.

After you copy in the secureStrings, click the lock button by the two connection strings, leave system.debug unencrypted.

Your pipeline should look similar to this.

Now we will go to the OverrideArmParameters text area. We will use the following syntax -variableNameFromTheJsonFile $(devOpsPipelineVarriable)

For example:

-bballasw-WorkspaceDefaultSqlServer_connectionString $(WorkspaceDefault) -Lahman_connectionString $(Lahman)

Yours may vary based your number of secureStrings and names. Now let's click Save on our pipeline.

Make a comment and click OK

Now click Create release.

Click Create

Click Release-1 (or whatever your release number is).

After your Agent begins to process click on Logs and watch it run!

AND NOW!!!!! ......it failed.

A few times. But hey, it's not developing if there isn't a failure. So it's almost 1 am, and I *believe* I have it running so let me take this time to walk you through what I've found.

Spark Pools and Self Hosted Integration Runtimes are not created in a pipeline. If you have a Linked Service that uses a Self Hosted Integration Runtime you will need to manually create that in your QA or Prod environment prior to deployment.

If you are developing Notebooks and have them connected to a Spark Pool, you will need to recreate that Spark Pool in QA or Production. Notebooks that are linked to a Spark Pool that does not exist in an environment will fail to deploy.

Name them the same thing. Do not change names. Trust me.

If you are doing a deployment and your Provisioned SQL Pools are Paused then the deployment will fail. *More to come on database migrations, a database project build, and release is still needed.

Here's a quick image.

I'm on release 4, attempt 2. This appears to be running just fine for me.

VICTORY!!

Now let us go and check our QA Workspace! First up Scripts and Notebooks.

Excellent! Everything is there. Next let us look at our Provisioned SQL Pools.

Looks great! As a quick side note the databases will be brought over at DW100c, so you can auto scale them as needed. Also if you are in a demo environment like me, be sure to pause them after the deployment completes. Next up Pipelines.

I like this! Now let's check out linked services.

I don't like this. Here are my Dev links for my default workspace in my QA environment. Right now the only way I've found to clean this up is to use the Az.Synapse PowerShell Module. Navigate back to your release pipeline. Edit it, add an Azure PowerShell task. We will then use this script:

##Required for azure devops initial deployment
Install-Module Az.Synapse -RequiredVersion 0.2.0 -Scope CurrentUser -Force

#get rid of dev linked service in QA
Remove-AzSynapseLinkedService -WorkspaceName yourworkspaceName -Name linkedservicetoRemove
Remove-AzSynapseLinkedService -WorkspaceName yourworkspaceName -Name linkedservicetoRemove

Under Azure PowerShell version options select Specify other version, set Preferred Azure PowerShell Version 3.1.0.

The next time you run your deployment, this should clean up those links.

All right Dear Reader, I'm off to sleep. Happy Monday and as always, thank you for stopping by.

Thanks,

Brad

Updated Feb 01, 2021

Version 1.0

Bradley_Ball

Microsoft

Joined August 23, 2017

View Profile

Data Architecture Blog

Follow this blog board to get notified when there's new activity

41 Comments

theodicy
Copper Contributor
Aug 28, 2023
Thank you for detailed explaination!
I have one question regarding the OverrideArmParameters the deployment task: is it possible to outsource the parameters to a different (e.g. yaml-)file in the repo?
The info I found online, where files instead of direct value assignment are used for this, are for Github Actions. I could neither find an example here, nor in the official MS doc here: https://learn.microsoft.com/en-us/azure/synapse-analytics/cicd/continuous-integration-delivery#set-up-a-stage-task-for-an-arm-template-to-create-and-update-a-resource
The best I could find is this: https://github.com/Azure/Synapse-workspace-deployment#overriding-parameters-overridearmparameters
But it uses Github actions instead of Azure pipeline.
I gave it a try nevertheless, where instead of specifying the param names and values, I put a file path for OverrideArmParameters like : "./deploymentParams/params.yaml" or "deploymentParams/params.yaml" which is located in my repo, and judging by the results, neither worked. The deployment pipeline went through without error, including this line:
And when I check the synapse workspace, the property values for those resources are not overwritten.

Do you have more insights on this, or point me somewhere where I could find info on whether / how the OverrideArmParamters could be outsourced as a file with Azure devops pipeline?
Rohit1380
Copper Contributor
Aug 07, 2023
Thanks for this detailed blogs, really help to use Devops for synapse.
Have anyone tried any rollback startegy for Synapse Serverless SQL through CI CD pipelines ?
As far as i understood , synapse serverless SQL doesn’t persist any data (only schema for EXTERNAL TABLES and VIEWS) so it relies on ADLS recovery. Schema can be stored and managed using the source control integration feature of Synapse with Azure devops repository for DR purpose.
Is there a way to create restore point for synapse serverless SQL or to backup serverless SQL pool databases, please suggest how to achieve !!
StuartPlover
Copper Contributor
Nov 15, 2022
If anybody else is having trouble with KeyVaults then you need to:
1. Go to the destination synapse workspace (the target of the deployment);
2. Open the synapse workspace, go to the "Manage" section on the left, then down to "Security" and "Access control";
3. Add the role "Synapse Artifact Publisher" to the DevOps account.

(Sorry, I don't have the permission to upload images)
Brent_Leslie
Copper Contributor
Nov 14, 2022
StuartPlover - Keyvaults are a bit special, you need to give the Principal an Access Policy, which is done in the KeyVault as well (usually get/list permissions are fine):
StuartPlover
Copper Contributor
Nov 14, 2022
Addendum to above:
2022-11-14T16:10:37.8894325Z For Artifact: [MY KEYVAULT LS]: Deploy artifact failed: {"error":{"code":"Unauthorized","message":"The principal '[REMOVED]' does not have the required Synapse RBAC permission to perform this action. Required permission: Action: Microsoft.Synapse/workspaces/linkedServices/write, Scope: workspaces/[REMOVED]-dev-workspace."}}

The principal id I have removed is the correct principle. And the removed workspace is the correct workspace. The principal has "owner" role on the workspace.
StuartPlover
Copper Contributor
Nov 14, 2022
Hi Bradley,

This is really useful.

I only have the one securestring (the default) as everything is in a key vault. The deployment is failing on the keyvault with a 403 forbidden. I have assigned the contributor role to the DevOps account I am using against both dev and prod key vaults but no joy. Any suggestions on what I am doing wrong?

2022-11-14T15:40:06.1434878Z ##[debug]{"subscriptionID":"[REMOVED]","subscriptionName":"Microsoft Azure Enterprise","servicePrincipalClientID":"***","environmentAuthorityUrl":"https://login.windows.net/","tenantID":"[REMOVED]","url":"https://management.azure.com/","environment":"AzureCloud","scheme":"ServicePrincipal","activeDirectoryResourceID":"https://management.azure.com/","azureKeyVaultServiceEndpointResourceId":"https://vault.azure.net","azureKeyVaultDnsSuffix":"vault.azure.net","scopeLevel":"Subscription","authenticationType":"spnKey","servicePrincipalKey":***,"isADFSEnabled":false,"applicationTokenCredentials":{"clientId":"***","domain":"[REMOVED]","baseUrl":"https://management.azure.com/","authorityUrl":"https://login.windows.net/","activeDirectoryResourceId":"https://management.azure.com/","isAzureStackEnvironment":false,"authType":"spnKey","secret":***,"isADFSEnabled":false}}

2022-11-14T15:40:06.1437097Z ##[debug][POST]https://login.windows.net/[REMOVED]/oauth2/token/

2022-11-14T15:40:06.6396150Z For Artifact: Az_Kv_SSSC: ArtifactDeploymentTask status: 403; status message: Forbidden
Victor2260
Copper Contributor
Jun 02, 2022
I rarely read such well documented series which are as easy to follow, love it!
Thanks so much for the series, it helped me out a lot. There are some changes (e.g. the checkbox to delete any artifacts in the target synapse workspace which are not in the template anymore) and the PowerShell Script (remove linked services) does not work, but those are only minor things anyway which we can figure out on our own 🙂
Sanjay_Rathod
Copper Contributor
Mar 21, 2022
Hello Bradley

Tons and tons of thanks to you for writing this detailed blogs for us, you have made very easy for us to use Devops especially for synapse.

I followed your steps and was able to deploy synapse code from dev to QA like a hot knife passing through butter.

Like previous member, Serverless pool was not deployed for me as well. Will try to figure out but if you got some hints let us know.

Thanks once again !!

Kind Rgds,
Sanjay
Sgandhiii
Copper Contributor
Feb 28, 2022
Hi bradley,

The above mentioned steps helped a lot. Thanks a ton 🙂 Also , I’ve few queries mentioned below . It would be great if you could help me with the same.

1) I am unable to deploy the Dedicated SQL pools into the other environment using the above process. Only SQl script, pipelines etc are automatically deployed using this process. Is there any way to deploy the dedicated Sql pool automatically as well?

2) Suppose I’ve a large number of synapse pipelines and i want only few to be deployed to the next environment. Basically, i want to exclude certain pipelines from getting deployed in the other environment. I’ve already tried the exclude pattern but it is not working in the above scenario
Mildofly
Copper Contributor
Feb 04, 2022
Hi Bradley

This series of articles has been extremely helpful! Thank you for sharing.

One thing I've noticed is that if I drop an item from my dev workspace it won't then drop the equivalent in my test workspace. Is there a setting for allowing the release to drop items?