Microsoft Entra Suite Tech Accelerator
Aug 14 2024, 07:00 AM - 09:30 AM (PDT)
Microsoft Tech Community
SOLVED

How long for full "Discovery" scan to complete after creation of a new Sensitive Information Type?

Copper Contributor

Our organization is trying to use keyword searches on individual names to do CCPA discovery searches. This seems like a good option to discovery instances of a person's name in Exchange, OneDrive, etc. but the question becomes---how long until the full scan completes after initial creation of the SIT? We let it sit for 2 or 3 days and it came back with 1 OneDrive hit. After 2 more days, it came up with 3 additional OneDrive hits (and these were not NEWLY created documents).

 

Just need to know so we can advise on our SLA for doing CCPA discovery to our director.

 

Thanks!

 

Luke Fisher

2 Replies
best response confirmed by Luke_Michael_Fisher (Copper Contributor)
Solution

@Luke_Michael_Fisher 

 

This probably isn't the information you were looking for, however, in the past when I have raised this question with Microsoft, they have not been able to give a definitive answer here. It doesn't appear there is an exact calculation on how long the scans should take outside of the more data you have, the longer it takes.

 

In that same breath, I have not been given or able to find the pattern in which the crawlers (Microsoft's term for what scans the data) process the data, which is why you are seeing older data appear before new data.

 

I understand you're working to give better timeline expectations to your director, similar to how I had to give this answer to my clients, and I know this is not ideal information. If I find that the answer to this has changed, I will certainly follow up on here if someone else hasn't already corrected me.

Thanks Mike. Not ideal but it is good to know nonetheless! Appreciate the thoughtful response.
1 best response

Accepted Solutions
best response confirmed by Luke_Michael_Fisher (Copper Contributor)
Solution

@Luke_Michael_Fisher 

 

This probably isn't the information you were looking for, however, in the past when I have raised this question with Microsoft, they have not been able to give a definitive answer here. It doesn't appear there is an exact calculation on how long the scans should take outside of the more data you have, the longer it takes.

 

In that same breath, I have not been given or able to find the pattern in which the crawlers (Microsoft's term for what scans the data) process the data, which is why you are seeing older data appear before new data.

 

I understand you're working to give better timeline expectations to your director, similar to how I had to give this answer to my clients, and I know this is not ideal information. If I find that the answer to this has changed, I will certainly follow up on here if someone else hasn't already corrected me.

View solution in original post