Forum Discussion
h_a
Nov 01, 2025Copper Contributor
Detecting Duplicate Documents
I am looking for an approach to identify duplicate documents within and across file servers of an organisation.
What functionalities would be used for this and preferably if someone can provide a practical, step by step approach it will help. Am relatively new to Purview.
Understand this should be probably possible using Information protection, but not clear exactly how. Thanks for help.
1 Reply
- Dean_GrossSilver Contributor
The Purview Information Protection scanner can be used for the on-premises file shares, this will identify files that have sensitive information, but won't help with duplicate files. Purview does not include any specific functionality to help with finding duplicate files, PowerShell and 3rd party products will be needed.