Find sensitive data types in Outlook mailboxes

Question

Hi all,&nbsp;I want to help my users reduce the size of their large Outlook inboxes, for efficiency and security purposes. I want them to start with the sensitive data, such as Swift codes, credit card numbers etc. I want to use Purview to highlight who has what, and where exactly to find it in their mailbox, so that these emails can be filed in the right file storage system we have, rather than sitting in their mailbox.&nbsp;&nbsp;I want an accurate report to show that user 1 has x amount of that data, including the sensitive data type to help with prioritising their efforts, the email Subject to identify and locate the relevant email for filing etc.&nbsp;I can see in the Purview Content explorer that there are x amount of credit card data, of which A of that total are in Exchange, B is in Sharepoint etc.&nbsp;If I expand the Exchange section, I can see which mailbox has what, as a count summary. If I click on each individual mailbox, I can see the sum total of the different data types that Purview has detected. However, in the portal I can see that some of the results are low and medium confidence.&nbsp;&nbsp;If I export the data as .xlsx for that one mailbox as a test, I see a bunch of script in the formula bar that is difficult to read, let alone separate out easily to find only the high confidence stuff.&nbsp;&nbsp;Am I going about this the right way?&nbsp;If so, how can I make this as quick and painless as possible for myself, to be able to say who has what and where, or at least verify what Purview is telling me?&nbsp;&nbsp;If not, what should/can I do&nbsp;to help our users cull their risk?

ismkay · Answer

Hi&nbsp;GI472,&nbsp;&nbsp;Your use case is in other words about data discovery.Content Explorer is good place to look for such content but you have to parse the data to be able to read it correctly.https://learn.microsoft.com/en-us/microsoft-365/compliance/data-classification-content-explorer?view=o365-worldwideYou can try to extract the data and visualize it using Power BI for example similar to what was done in this reference&nbsp;Microsoft 365 Compliance audit log activities via O365 Management API - Part 2 - Microsoft Community Hub&nbsp;As for confidence level indicated in the content explorer it is based on confidence level of the SITs and this can be changed&nbsp;https://learn.microsoft.com/en-us/microsoft-365/compliance/data-classification-increase-accuracy?view=o365-worldwide&nbsp;If you need to look from now on, I would suggest to add some policies to inspect content using one of multiple options available. Such as&nbsp;a file policy within Microsoft Defender for Cloud Apps with filter App=ExchangeOnline to discover specific SITs or Trainable Classifiers using content inspection.https://learn.microsoft.com/en-us/defender-cloud-apps/data-protection-policieshttps://learn.microsoft.com/en-us/defender-cloud-apps/dcs-inspection&nbsp;Here you have to be careful since you are looking at real user data and you may have legal obligations for your actions.&nbsp;Thank you and kind regards&nbsp;

gi472 · Answer

Thanks, IsmKay, I'll take a look at those links!

Forum Discussion

Find sensitive data types in Outlook mailboxes

2 Replies

Resources