How classification rules work with xls-files stored in blob storage

%3CLINGO-SUB%20id%3D%22lingo-sub-2137958%22%20slang%3D%22en-US%22%3EHow%20classification%20rules%20work%20with%20xls-files%20stored%20in%20blob%20storage%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2137958%22%20slang%3D%22en-US%22%3E%3CP%3EHow%20does%20the%20data%20pattern%20and%20column%20pattern%20and%20the%20distinct%20and%20match%20criteria%20apply%20to%20xls-files%20stored%20in%20blob%20storage.%20It%20will%20be%20helpful%20to%20have%20an%20example%20of%20how%20to%20regex%20the%20filename%20for%20classification%20and%20an%20example%20for%20file%20content.%3C%2FP%3E%3C%2FLINGO-BODY%3E%3CLINGO-SUB%20id%3D%22lingo-sub-2161197%22%20slang%3D%22en-US%22%3ERe%3A%20How%20classification%20rules%20work%20with%20xls-files%20stored%20in%20blob%20storage%3C%2FLINGO-SUB%3E%3CLINGO-BODY%20id%3D%22lingo-body-2161197%22%20slang%3D%22en-US%22%3E%3CP%3EHi%26nbsp%3B%3CA%20href%3D%22https%3A%2F%2Ftechcommunity.microsoft.com%2Ft5%2Fuser%2Fviewprofilepage%2Fuser-id%2F869147%22%20target%3D%22_blank%22%3E%40ChandruS%3C%2FA%3E%26nbsp%3B%2C%20do%20you%20have%20any%20further%20information%20on%20how%20classification%20rules%20work%20with%20xls-files%3F%26nbsp%3B%3C%2FP%3E%3C%2FLINGO-BODY%3E
Contributor

How does the data pattern and column pattern and the distinct and match criteria apply to xls-files stored in blob storage. It will be helpful to have an example of how to regex the filename for classification and an example for file content.

5 Replies

Hi @ChandruS , do you have any further information on how classification rules work with xls-files? 

Hi @ChandruS , is there any further documentation on how this works? 

Hi @DebbieH,

 

Classification rules are applied based on the RegEx pattern using data pattern and/or column pattern. 

 

Distinct match threshold is the total number of "distinct data values" that need to be found in a column before the scanner runs the data pattern on it. Minimum match threshold is the minimum percentage of data value matches in a column that must be found by the scanner for the classification to be applied.

 

Relevant documentation: https://docs.microsoft.com/en-us/azure/purview/supported-classifications

 

 

 

Hi @AniMukherjee ,

 

Thank you. We are trying to understand what is classified as data and as a column in an xls file?

Hi @DebbieH,

 

Currently, we don't provide details on what's classified as data and as a column in a file. We will keep this requirement in mind as part of future improvement.