Parquet format support added to Wrangling Data Flow in Azure Data Factory
Published Feb 05 2020 01:25 PM 5,576 Views
Microsoft

Wrangling Data Flow (WDF) in ADF now supports Parquet format. You can have your data stored in ADLS Gen2 or Azure Blob in parquet format and use that to do agile data preparation using Wrangling Data Flow in ADF

 

Create a parquet format dataset in ADF and use that as an input in your wrangling data flow

 

2020-02-05_13h17_42.png

 

You can then use the parquet format dataset as an input to your Wrangling Data Flow to do agile data preparation at cloud scale via spark execution

 

2020-02-05_13h20_00.png

 

 

2020-02-05_13h22_50.png

 

Learn more about using Wrangling Data Flow to do data preparation at cloud scale here.

 

2 Comments
Copper Contributor

@Gaurav Malhotra, it is exciting to see Parquet in Power Query. Do you know if this same capability is coming to Power Query in the Power Platform, such as Power BI dataflows? Or is this feature dependent on ADF-specific parquet interpretation tech?

Copper Contributor

Is managed identify auth planned for the near term? My understanding is that if your ADLS instance is in a VNet, service principal auth can't be used with an Azure Auto resolve IR, so until MI auth is supported this feature can't be used.

Version history
Last update:
‎Feb 05 2020 01:25 PM
Updated by: