Forum Discussion

h_a's avatar
h_a
Copper Contributor
Nov 01, 2025

Detecting Duplicate Documents

I am looking for an approach to identify duplicate documents within and across file servers of an organisation.

What functionalities would be used for this and preferably if someone can provide a practical, step by step approach it will help. Am relatively new to Purview.

Understand this should be probably possible using Information protection, but not clear exactly how. Thanks for help.

1 Reply

  • Dean_Gross's avatar
    Dean_Gross
    Silver Contributor

    The Purview Information Protection scanner can be used for the on-premises file shares, this will identify files that have sensitive information, but won't help with duplicate files. Purview does not include any specific functionality to help with finding duplicate files, PowerShell and 3rd party products will be needed. 

Resources