Forum Discussion

Riaki22's avatar
Riaki22
Copper Contributor
Mar 18, 2025
Solved

Future Support for Configurable Sampling in Purview Classification

We have a question regarding the sampling method used in Microsoft Purview for classification.

Based on the documentation, we understand that for tabular data sources (e.g., SQL databases), Purview samples only the top 128 rows for classification.

However, our client has tables with millions of rows, and this small sample size may not be representative of the actual data. Is there any plan to allow users to configure the number of sampled rows in future updates? This would greatly improve classification accuracy for large datasets.

Thanks in advance for your insights!

1 Reply

Resources