Read and Write Complex Data Types in ADF

Microsoft

Oct 12, 2020

ADF has connectors for Parquet, Avro, and ORC data lake file formats. However, datasets used by Copy Activity do not currently have support for those types. Here is how to read and write those complex columns in ADF by using data flows.

There is a description of this technique in each file format documentation page in the ADF online docs:

https://docs.microsoft.com/en-us/azure/data-factory/format-orc#dataset-properties

https://docs.microsoft.com/en-us/azure/data-factory/format-parquet#data-type-support

https://docs.microsoft.com/en-us/azure/data-factory/format-avro#data-flows

Step 1: Make a new dataset and choose the file format type. In this example, I am using Parquet. Set NONE for schema:

Step 2: Make a data flow with this new dataset as the source:

Step 3: Go to Projection -> Import Projection

Step 4: You’ll see your data under Data Preview

Updated Oct 12, 2020

Version 2.0

azure data factory

Azure Data Integration

Azure ETL

Mapping Data Flows

Mark Kromer

Microsoft

Joined August 14, 2018

View Profile

Azure Data Factory Blog

Follow this blog board to get notified when there's new activity

Blog Post

Read and Write Complex Data Types in ADF

1 Comment