Forum Discussion

DebbieH's avatar
DebbieH
Copper Contributor
Feb 16, 2021

How classification rules work with xls-files stored in blob storage

How does the data pattern and column pattern and the distinct and match criteria apply to xls-files stored in blob storage. It will be helpful to have an example of how to regex the filename for classification and an example for file content.

5 Replies

    • AniMukherjee's avatar
      AniMukherjee
      Former Employee

      Hi DebbieH,

       

      Classification rules are applied based on the RegEx pattern using data pattern and/or column pattern. 

       

      Distinct match threshold is the total number of "distinct data values" that need to be found in a column before the scanner runs the data pattern on it. Minimum match threshold is the minimum percentage of data value matches in a column that must be found by the scanner for the classification to be applied.

       

      Relevant documentation: https://docs.microsoft.com/en-us/azure/purview/supported-classifications

       

       

       

      • DebbieH's avatar
        DebbieH
        Copper Contributor

        Hi AniMukherjee ,

         

        Thank you. We are trying to understand what is classified as data and as a column in an xls file?

  • DebbieH's avatar
    DebbieH
    Copper Contributor

    Hi ChandruS , do you have any further information on how classification rules work with xls-files? 

Resources