Using Microsoft Purview to Identify and Label Sensitive Data Exposed to Generative AI Tools

Question

Hello everyone,I'm currently working on a data governance initiative and would like to leverage Microsoft Purview to automatically identify and label sensitive data that could potentially be exposed through generative AI (GenAI) tools like Microsoft Copilot, Azure OpenAI services, or other integrated conversational agents.My main goals are to:Detect and label sensitive data that may be surfaced or referenced in prompts or AI-generated outputsApply sensitivity labels and DLP policies to restrict inappropriate data exposureIntegrate this detection into a broader data loss prevention (DLP) strategyHere are my questions:What are the recommended steps to configure Microsoft Purview for monitoring and labeling sensitive data in a GenAI environment?Is there a way to audit or trace sensitive data usage within interactions involving Copilot or other AI tools?Do you have any best practices or examples of configuring Sensitive Information Types (SITs) or DLP policies tailored for GenAI scenarios?Does Microsoft offer native integration between Purview and AI activity, or would we need custom connectors/logs to monitor data exposure?Any experience, guidance, or references would be greatly appreciated. Thanks in advance for your support!

nikkichapple · Answer

Microsoft Purview can help minimise the risks of oversharing of your sensitive data with Gen Ai tools. You can configure DSPM for AI to see what data is being shared with Copilot, and if you have E5 Compliance licensing, you can also track 3rd party Gen AI Usage.&nbsp;Microsoft Purview – Data Security Posture Management (DSPM) for AI | Microsoft Community Hub. To get started with managing your sensitive data, you need to understand what your sensitive data is and what activities you want to protect. This could be blocking external sharing, blocking device activities, blocking Copilot or blocking 3rd party Gen Ai, etc. Microsoft have a great oversharing blueprint guide to help you get started quickly.&nbsp; Take a look at the Microsoft 365 Copilot blueprint for oversharing | Microsoft Learn&nbsp;

armelkamgangfotso · Answer

Thank you very much, for your feedback I will read the documentation before starting to implement

Forum Discussion

Using Microsoft Purview to Identify and Label Sensitive Data Exposed to Generative AI Tools

2 Replies

Resources