ADF adds TTL to Azure IR to reduce Data Flow activity times

Microsoft

Sep 27, 2019

ADF has added a TTL (time-to-live) option to the Azure Integration Runtime for Data Flow properties to reduce data flow activity times.

This setting is only used during ADF pipeline executions of Data Flow activities. Debug executions from pipelines and data preview debugging will continue to use the debug settings which has a preset TTL of 60 minutes.

If you leave the TTL to 0, ADF will always spawn a new Spark cluster environment for every Data Flow activity that executes. This means that an Azure Databricks cluster is provisioned each time and takes about 5-7 minutes to become available and execute your job.

However, if you set a TTL, ADF will maintain a pool of VMs which can be utilized to spin-up each subsequent data flow activity against that same Azure IR. This reduces the amount of time needed to start-up the environment before your job is executed.

ADF will maintain that pool for the TTL time after the last data flow pipeline activity executes. Note that this will extend your billing period for a data flow to the extended time of your TTL. However, your data flow job execution time will decrease because of the re-use of the VMs from the compute pool. The compute resources are not provisioned until your first data flow activity is executed using that Azure IR.

Read more about the Azure Integration Runtime here. And here is an ADF Data Flow performance guide to help you optimize your environment.

Updated Sep 27, 2019

Version 2.0

azure data factory

Mark Kromer

Microsoft

Joined August 14, 2018

View Profile

Azure Data Factory Blog

Follow this blog board to get notified when there's new activity

21 Comments

Mark Kromer
Microsoft
Aug 26, 2021
Reasat it is coming to Synapse in CY21
Reasat
Copper Contributor
Aug 23, 2021
Mark Kromer Any update on when TTL for Synapse Data Flows will be available??
Thanks.
Mark Kromer
Microsoft
Apr 14, 2021
Abhijeetuk Go to Azure Integration Runtime under "Manage" in ADF UI ... Click on Data flow runtime properties accordion at bottom on panel and set a TTL. Then choose "Quick re-use".
Abhijeetuk
Copper Contributor
Apr 14, 2021
Mark Kromer , , I cant find this setting in ADF , can you please help. I need to set TTL for cluster.
Mark Kromer
Microsoft
Apr 13, 2021
Reasat No ETA/timeline at this time
Reasat
Copper Contributor
Apr 12, 2021
Mark Kromer Do you have a tentative timeline when TTL for Synapse Data Flows will be available? We need this to plan our production migration. Thanks in advance!
Abhijeetuk
Copper Contributor
Mar 20, 2021
Mark Kromer , I cant find this setting in ADF , can you please help
Timo
Copper Contributor
Mar 11, 2021
Mark Kromer I came across the same issue. Do you have a timeline when TTL for Synapse Data Flows will be available?
Mark Kromer
Microsoft
Mar 01, 2021
Reasat Synapse has not yet implemented TTL in the Azure IR feature. This is something that they are currently working on as a top priority.
Reasat
Copper Contributor
Mar 01, 2021
We are using Synapse

Blog Post

ADF adds TTL to Azure IR to reduce Data Flow activity times