Forum Discussion
Retrieving tye schema of the scanned asset files into synapse pyspark notebook
Hi there, we are having a purview account,synapse with pyspakr, storage account. We are scanning the files in purview that are stored in storage, we are able to see the schema in purview for those files,now we are trying to retrieve the schema f for the column in those files in synapse pyspark notebook. We are doing this coz we want to dynamically create the tables in sql database using these column schemas for hese files. But for that need to retrieve this schema. Can anybody tell how we can do that? Thanks in adv.
Also we are referring to this article - https://techcommunity.microsoft.com/t5/azure-architecture-blog/exploring-purview-s-rest-api-with-python/ba-p/2208058
Init we could get the aseet list with files naes, guid etc. How can we then get the schema for those files using the guid if at all possible.
Also ,in the article the code is connecting to storage account and reading file ,not sure why it needs to connect to storage account when we are trying to get the schema from purview .