Forum Discussion
souvitaofernandes
Oct 06, 2025Copper Contributor
EDM Classifier in Microsoft Purview: No Match Detected After Successful Indexing
Hello everyone,
I’m working on an Exact Data Match (EDM) classifier in Microsoft Purview using the new experience, and I’m facing an issue where the classifier does not return any matches during testing, even though all steps were completed successfully.
What I Did
- Created EDM Classifier:
- Primary element: CPF (ndodocumentodeidentificacao), with ignored delimiters (.,-).
- Text fields configured as case-insensitive.
- Phone field configured to ignore spaces and hyphens.
- Detection Rules:
- High Confidence: CPF + Name + Email + Phone.
- Medium Confidence: CPF + Name OR Email.
- Proximity: default (300 characters), Wide Mode.
- Downloaded schema, ran ValidateData → Passed.
- Performed hash and upload using EdmUploadAgent.exe → Completed successfully.
- Status in Purview: Index Complete.
The Problem
When I test using EDM Classifier Test with a .docx file containing exactly one row from the original CSV (CPF, Name, Email, Phone, etc.), the results are:
- Base SIT (Brazil CPF) detects the CPF.
- EDM SIT → No match detected.
What I’ve Verified
- Document saved as a real .docx (not just renamed).
- Data matches the CSV exactly (same formatting, same delimiters).
- Indexing status shows Complete.
Questions
- Has anyone experienced this issue?
- Could this be related to tokenization, proximity, or confidence rules?
- Are there any additional configurations or limitations I should be aware of?
Environment
- Microsoft Purview (new EDM experience).
- EDMUploadAgent installed on Windows Server.
- CSV file: UTF-8, comma-delimited, header matches schema.
Any insights or troubleshooting tips would be greatly appreciated!
No RepliesBe the first to reply