Feb 15 2021 06:50 PM
How does the data pattern and column pattern and the distinct and match criteria apply to xls-files stored in blob storage. It will be helpful to have an example of how to regex the filename for classification and an example for file content.
Feb 23 2021 05:34 PM
Hi @ChandruS , do you have any further information on how classification rules work with xls-files?
Feb 23 2021 09:26 PM
Hi @ChandruS , is there any further documentation on how this works?
Feb 24 2021 09:54 AM
Hi @DebbieH,
Classification rules are applied based on the RegEx pattern using data pattern and/or column pattern.
Distinct match threshold is the total number of "distinct data values" that need to be found in a column before the scanner runs the data pattern on it. Minimum match threshold is the minimum percentage of data value matches in a column that must be found by the scanner for the classification to be applied.
Relevant documentation: https://docs.microsoft.com/en-us/azure/purview/supported-classifications
Feb 24 2021 10:50 PM
Hi @AniMukherjee ,
Thank you. We are trying to understand what is classified as data and as a column in an xls file?
Feb 25 2021 07:36 AM
Hi @DebbieH,
Currently, we don't provide details on what's classified as data and as a column in a file. We will keep this requirement in mind as part of future improvement.