Blog Post

Azure Data Factory Blog
2 MIN READ

Announcing the Public Preview of a new top-level CDC resource in ADF

Noelle_Li's avatar
Noelle_Li
Icon for Microsoft rankMicrosoft
Jan 23, 2023

One of the most difficult aspects to building enterprise-ready data integration pipelines is capturing just the latest changed data from disparate sources quickly. Change Data Capture (CDC) is a set of technologies and techniques that enables data engineers to discover changed records as they occur at the source.

 

Azure Data Factory (ADF) has recently added many new CDC-enabled connectors to process change data from SQL, Storage, Cosmos DB, and many other sources. Much of the feedback that we received from our users about this has been centered around making it easy to configure and to continuously detect changes at the source.

 

We heard your feedback and are super excited to announce the public preview and immediate release of a new top-level ADF resource, found in your ADF resource explorer, that will automatically detect changes in your data source(s).

 

NOTE: This feature is being rolled out to Azure regions throughout this week.

 

 

With this new feature, you can quickly configure a continuous process to automatically read changes from your data source(s) without needing to design pipelines or data flows and triggers. Just point to your sources, tell ADF where you want to land the data, and click start. It’s that easy!

 

 

With the CDC resource, there is no need to learn ADF concepts like triggers, schedules, integration runtimes, and more. Simply follow a guided process to configure your sources, targets, and any required data transformation options. ADF will then auto-map your columns based on fuzzy matching.

 

The new CDC resource will also offer continuous data integration as well as frequency-based latencies. This makes budgeting super-easy as the cost of using a CDC resource is a 4-core General Purpose dataflow cluster that is only billed while your data is being processed based on the latencies you select.

 

 

Get started now with data processing by trying the new Change Data Capture resource in Azure Data Factory, now available in Public Preview. Find more information in our documentation and follow this guide to set up your new CDC resource.

 

We are always open for feedback so please let us know your thoughts in the comments below or add to our Ideas forum.

 

Updated Jan 23, 2023
Version 1.0

31 Comments

  • emherz's avatar
    emherz
    Copper Contributor

    Does this new top-level feature work with SAP? SAP is not mentioned as a Supported Data Source for CDC Resource. However, the SAP CDC connector is mentioned as a supported connector. 

     

    The documentation shows Supported Data Sources as Avro, Azure Cosmos DB (SQL API), Azure SQL Database, Delimited Text, JSON, ORC, Parquet, SQL Server, XML.
    https://learn.microsoft.com/en-us/azure/data-factory/concepts-change-data-capture-resource#supported-data-sources

     

    The documentation also shows the supported connectors as SAP CDC, Azure SQL Database, SQL Server, Azure SQL Managed Instance, Azure Cosmos DB (SQL API).

    https://learn.microsoft.com/en-us/azure/data-factory/concepts-change-data-capture#native-change-data-capture-in-mapping-data-flow

     

    • luisvexl's avatar
      luisvexl
      Copper Contributor

      2 year later is still in preview? Do we have a timeline for GA?

      • Noelle_Li's avatar
        Noelle_Li
        Icon for Microsoft rankMicrosoft

        Hi, this is still in preview. There is no timeline for GA at this time. We do have something similar in Fabric Data Factory called Copy Job that is now GA. 

  • Alexis_Ortega if the data store that you are connecting to as a source supports marking rows as deletes (i.e. SQL Server Native CDC), then you can also sink deletes with CDC in ADF.

     

    ericpasztor  We are working on enabling SQL MI and it should be ready in a few weeks.

  • Great feature. Is this supported with SQL Managed Instance or only Azure SQL Database and SQL Server?

  • versydney Synapse already has similar functionality with Synapse Link and Data Mapper. This public preview is only available in ADF as CDC.

  • versydney's avatar
    versydney
    Iron Contributor

    This is awesome. Will this be available in Synapse Pipelines too or is there an equivalent functionality?