Limiting scans being captured every time Data Factory pipeline is executed

%3CLINGO-SUB%20id%3D%22lingo-sub-2118685%22%20slang%3D%22en-US%22%3ELimiting%20scans%20being%20captured%20every%20time%20Data%20Factory%20pipeline%20is%20executed%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2118685%22%20slang%3D%22en-US%22%3E%3CP%3EHi%2C%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3CP%3EHow%20can%20you%20limit%20the%20scans%20that%20are%20done%20on%20the%20execution%20of%20a%20Data%20Factory%20pipeline%3F%3C%2FP%3E%3CP%3ECurrently%20it%20seems%26nbsp%3B%20you%20cannot%20select%20how%20frequently%2C%20the%20scan%20executes%20every%20time%20a%20pipeline%20is%20executed.%3C%2FP%3E%3CP%3EWe%20would%20want%20the%20functionality%20to%20limit%20this%20when%20there%20have%20been%20no%20significant%20design%20changes.%3C%2FP%3E%3CP%3E%26nbsp%3B%3C%2FP%3E%3C%2FLINGO-BODY%3E%3CLINGO-SUB%20id%3D%22lingo-sub-2119358%22%20slang%3D%22en-US%22%3ERe%3A%20Limiting%20scans%20being%20captured%20every%20time%20Data%20Factory%20pipeline%20is%20executed%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2119358%22%20slang%3D%22en-US%22%3E%3CP%3E%3CA%20href%3D%22https%3A%2F%2Ftechcommunity.microsoft.com%2Ft5%2Fuser%2Fviewprofilepage%2Fuser-id%2F664171%22%20target%3D%22_blank%22%3E%40DebbieH%3C%2FA%3E%26nbsp%3BThe%20feature%20is%20to%20get%20operational%20information%20at%20the%20time%20of%20factory%20runtime.%20Such%20as%20status%2C%20rows%20impacted%20and%20much%20more%20data%20quality%20aspects%20in%20the%20future.%20We%20cant%20get%20such%20operational%20info%20at%20the%20definition%20time.%20May%20I%20know%20the%20requirement%20to%20limit%20the%20lineage%20capture%3F%3C%2FP%3E%3C%2FLINGO-BODY%3E%3CLINGO-SUB%20id%3D%22lingo-sub-2138047%22%20slang%3D%22en-US%22%3ERe%3A%20Limiting%20scans%20being%20captured%20every%20time%20Data%20Factory%20pipeline%20is%20executed%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2138047%22%20slang%3D%22en-US%22%3E%3CP%3EHi%20%3CA%20href%3D%22https%3A%2F%2Ftechcommunity.microsoft.com%2Ft5%2Fuser%2Fviewprofilepage%2Fuser-id%2F869147%22%20target%3D%22_blank%22%3E%40ChandruS%3C%2FA%3E%26nbsp%3B%2C%20if%20the%20cost%20to%20acquire%20the%20information%20is%20less%20than%20the%20value%20for%20decision%20making%20we%20may%20want%20to%20turn%20the%20feature%20off%20or%20limit%20how%20often%20it%20runs.%20For%20example%20the%20information%20captured%20on%20a%20weekly%20basis%20may%20be%20sufficient%20and%20limited%20benefit%20to%20getting%20the%20information%20at%20each%20execution.%3C%2FP%3E%3C%2FLINGO-BODY%3E
Contributor

Hi,

 

How can you limit the scans that are done on the execution of a Data Factory pipeline?

Currently it seems  you cannot select how frequently, the scan executes every time a pipeline is executed.

We would want the functionality to limit this when there have been no significant design changes.

 

3 Replies

@DebbieH The feature is to get operational information at the time of factory runtime. Such as status, rows impacted and much more data quality aspects in the future. We cant get such operational info at the definition time. May I know the requirement to limit the lineage capture?

Hi @ChandruS , if the cost to acquire the information is less than the value for decision making we may want to turn the feature off or limit how often it runs. For example the information captured on a weekly basis may be sufficient and limited benefit to getting the information at each execution.

Thanks for checking. At this point in Azure preview, we don't have any implications on cost for tracking the lineage once or every time the pipeline runs. We will keep in mind though at any point if this becomes a cost factor.