<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>Copilot Studio Blog articles</title>
    <link>https://techcommunity.microsoft.com/t5/copilot-studio-blog/bg-p/copilot-studio-blog</link>
    <description>Copilot Studio Blog articles</description>
    <pubDate>Thu, 28 May 2026 05:37:05 GMT</pubDate>
    <dc:creator>copilot-studio-blog</dc:creator>
    <dc:date>2026-05-28T05:37:05Z</dc:date>
    <item>
      <title>Computer-using agents in Microsoft Copilot Studio are now generally available</title>
      <link>https://techcommunity.microsoft.com/t5/copilot-studio-blog/computer-using-agents-in-microsoft-copilot-studio-are-now/ba-p/4519427</link>
      <description>&lt;P&gt;The next chapter of enterprise AI isn't about chatting with assistants—it's about &lt;STRONG&gt;agents that actually do the work.&lt;/STRONG&gt; Until now, automating long-tail, UI-driven business processes meant either building and maintaining brittle RPA scripts or waiting on APIs that legacy systems were never going to expose.&lt;/P&gt;
&lt;P&gt;That gap has kept some of the most valuable workflows—the ones buried in vendor portals, internal web apps, and proprietary line-of-business systems—out of reach for modern automation. For enterprise IT teams, the challenge hasn’t just been automating these workflows. It’s been doing so in a way that remains secure, governable, and scalable across the business.&lt;/P&gt;
&lt;P&gt;The gap is now closing.&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/computer-use" target="_blank" rel="noopener"&gt;&lt;STRONG&gt;Computer use in Microsoft Copilot Studio&lt;/STRONG&gt;&lt;/A&gt;&lt;STRONG&gt; is now generally available&lt;/STRONG&gt;, and we're expanding availability to&amp;nbsp;&lt;STRONG&gt;all commercial geographies in Microsoft Power Platform&lt;/STRONG&gt;.&lt;/P&gt;
&lt;H2&gt;New computer use features generally available&lt;/H2&gt;
&lt;P&gt;With this release, every Copilot Studio maker can build agents that don't just reason and respond—they take action directly inside any application a person can use. For IT teams, GA represents more than a new automation capability; it’s a shift toward a more governable and enterprise-ready model for AI-driven work. Organizations can better standardize how agents operate across applications while maintaining security, observability, and administrative control through the Power Platform admin center.&lt;/P&gt;
&lt;P&gt;With this release, computer use delivers:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Global availability across all commercial Power Platform geos&lt;/STRONG&gt;, so customers in every region can deploy computer use in agents under their tenant's data residency and compliance boundaries.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Secure authentication&lt;/STRONG&gt; with built in credentials and &lt;A href="https://learn.microsoft.com/en-us/power-apps/maker/data-platform/environmentvariables-azure-key-vault-secrets#configure-azure-key-vault" target="_blank" rel="noopener"&gt;Azure Key Vault&lt;/A&gt; when signing in to website or desktop applications.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Enterprise governance built in,&lt;/STRONG&gt;&amp;nbsp;allowing lists for websites or desktop applications and native Power Platform governance capabilities such as DLP policies, environment isolation, and audit trails.&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/human-supervision-computer-use" target="_blank" rel="noopener"&gt;&lt;STRONG&gt;Human-in-the-loop checkpoints&lt;/STRONG&gt;&amp;nbsp;&lt;/A&gt;for low-confidence steps, exceptions, and decisions that require an operator's approval.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Run history and observability&lt;/STRONG&gt;, so makers and admins can see exactly what the agent saw, what it clicked, and why. Logs are also propagated to Purview and Dataverse for audits and admin review.&lt;/LI&gt;
&lt;LI&gt;
&lt;P&gt;&lt;STRONG&gt;Model choice&lt;/STRONG&gt; &lt;STRONG&gt;for your agents&lt;/STRONG&gt;,&amp;nbsp;with models from OpenAI and Anthropic.&lt;/P&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;img&gt;
&lt;P&gt;&lt;EM&gt;Add computer use as a tool in a Copilot Studio agent&lt;/EM&gt;&lt;/P&gt;
&lt;/img&gt;
&lt;H2&gt;Reach every system, including the ones without APIs&lt;/H2&gt;
&lt;P&gt;Computer use gives an agent the same tools a person has: a browser, a screen, a keyboard, and the ability to read what's on the page and take the next logical step. Instead of brittle selector-based automation, the computer use tool uses vision and reasoning to navigate live UIs—adapting when layouts shift, fields move, or workflows branch.&lt;/P&gt;
&lt;P&gt;For organizations with deep investments in proprietary platforms or third-party portals, this changes the math on automation. Workflows that previously required either a multi-quarter integration project or an army of contractors clicking through screens can now be handed to an agent.&lt;/P&gt;
&lt;P&gt;For enterprise IT organizations, this can also reduce pressure to modernize or rebuild every legacy workflow before automation can begin. That helps teams extend the value of existing systems while still moving toward broader AI transformation goals.&lt;/P&gt;
&lt;H2&gt;Customer spotlight: Graebel automates global service order processing end to end&lt;/H2&gt;
&lt;P&gt;&lt;A href="https://www.graebel.com/" target="_blank" rel="noopener"&gt;Graebel&lt;/A&gt;, a global leader in talent mobility with approximately 1,500 employees, manages thousands of cross-border employee relocations every year for multinational clients. A significant share of those relocation requests arrives the way most enterprise work arrives: as free-form emails, full of unstructured instructions, attachments, and edge cases. Each email had to be read, interpreted, and entered by hand into Graebel's proprietary&amp;nbsp;Global Connect&amp;nbsp;platform.&lt;/P&gt;
&lt;P&gt;Global Connect couldn’t support a API-based integration, and earlier robotic process automation (RPA) attempts proved too rigid to keep up with the variability of human-written emails. Graebel needed automation that could use reasoning, not just click.&lt;/P&gt;
&lt;P&gt;Working with&amp;nbsp;GET AI&amp;nbsp;and Microsoft, Graebel built and deployed the&amp;nbsp;&lt;STRONG&gt;Graebel Service Order Agent, &lt;/STRONG&gt;equipped with computer use,&amp;nbsp;in Microsoft Copilot Studio. The agent now:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Monitors designated mailboxes and interprets unstructured service-order emails&lt;/STRONG&gt;&amp;nbsp;using &lt;A href="https://learn.microsoft.com/en-us/azure/ai-services/content-understanding/overview" target="_blank" rel="noopener"&gt;Azure Content Understanding&lt;/A&gt;, extracting key data into structured form with confidence scoring.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Validates each request&lt;/STRONG&gt;&amp;nbsp;against Graebel's business rules, service logic, and compliance requirements before any action is taken.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Operates Global Connect directly through its UI&lt;/STRONG&gt;—navigating screens, entering data, and completing transactions exactly as a trained human operator would, without APIs or platform redevelopment.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Escalates exceptions and low-confidence cases through human-in-the-loop workflows&lt;/STRONG&gt;, preserving governance and service quality.&lt;/LI&gt;
&lt;/UL&gt;
&lt;img&gt;
&lt;P&gt;&lt;EM&gt;Architecture of Graebel’s Power Automate flow and custom Service Order agent&lt;/EM&gt;&lt;/P&gt;
&lt;/img&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;&lt;EM&gt;“By adopting Microsoft Copilot Studio and AI agents, we’ve moved beyond traditional automation to a more intelligent, scalable operating model. This initiative strengthens our ability to serve clients faster and more accurately while positioning Graebel for long-term growth.” &lt;/EM&gt;- Matt Brownlee, Chief Revenue Officer, Graebel&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;The Service Order Agent is live today and processing real volume, with an architecture designed to scale across&amp;nbsp;&lt;STRONG&gt;more than 30 relocation service categories&lt;/STRONG&gt;. For Graebel, the results include a meaningful reduction in manual effort, faster service-order turnaround, more consistent data quality, and a repeatable blueprint for bringing intelligent automation to the rest of their operations.&lt;/P&gt;
&lt;P&gt;&lt;A class="lia-external-url" href="https://www.microsoft.com/en/customers/story/26190-graebel-dynamics-365-finance?msockid=1381dfa2902362b42abccb9991c86324" target="_blank" rel="noopener"&gt;&lt;SPAN data-teams="true"&gt;Read how Graebel drives growth and automation with Power Platform and Copilot Studio.&lt;/SPAN&gt;&lt;/A&gt;&lt;/P&gt;
&lt;H2&gt;How to get started&lt;/H2&gt;
&lt;P&gt;Ready to try computer‑using agents in Copilot Studio?&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;Create or open an agent in&amp;nbsp;&lt;STRONG&gt;Microsoft Copilot Studio.&lt;/STRONG&gt;&lt;/LI&gt;
&lt;LI&gt;Go to&amp;nbsp;&lt;STRONG&gt;Tools → Add tool → Add new&lt;/STRONG&gt;&amp;nbsp;&lt;STRONG&gt;computer use.&lt;/STRONG&gt;&lt;/LI&gt;
&lt;LI&gt;Describe the task you want the agent to perform in natural language.&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;For deeper guidance, configuration details, and best practices, see the&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/computer-use" target="_blank" rel="noopener"&gt;computer use documentation&lt;/A&gt;.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Before you go:&lt;/STRONG&gt;&amp;nbsp;We’re actively investing in advanced governance, operations, and scale for CUAs—and customer feedback directly informs the roadmap. Tell us what you think of the latest CUA updates today:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Email feedback&lt;/STRONG&gt;&amp;nbsp;to &lt;A href="mailto:computeruse-feedback@microsoft.com" target="_blank" rel="noopener"&gt;computeruse-feedback@microsoft.com&lt;/A&gt; &amp;nbsp;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Join&amp;nbsp;&lt;/STRONG&gt;the&amp;nbsp;&lt;A href="https://techcommunity.microsoft.com/category/microsoft365copilot/discussions/copilot-studio" target="_blank" rel="noopener"&gt;Copilot Studio community&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Wed, 13 May 2026 17:28:08 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/copilot-studio-blog/computer-using-agents-in-microsoft-copilot-studio-are-now/ba-p/4519427</guid>
      <dc:creator>MustaphaLazrek</dc:creator>
      <dc:date>2026-05-13T17:28:08Z</dc:date>
    </item>
    <item>
      <title>4 ways to build a curated Agent Store and scale agent adoption</title>
      <link>https://techcommunity.microsoft.com/t5/copilot-studio-blog/4-ways-to-build-a-curated-agent-store-and-scale-agent-adoption/ba-p/4518575</link>
      <description>&lt;H3&gt;The agent adoption gap is real&lt;/H3&gt;
&lt;P&gt;Agents are quickly becoming a core part of enterprise AI strategy. The &lt;A href="https://news.microsoft.com/annual-work-trend-index-2026" target="_blank" rel="noopener"&gt;2026 Work Trend Index&lt;/A&gt; shows 58% of employees are now producing work they couldn’t do a year ago, and AI is increasingly used for analysis, reasoning, and decision-making—not just tasks. While individual capability is accelerating, many organizations are struggling to keep up. The gap isn’t access to AI—it’s how work is designed around it.&lt;/P&gt;
&lt;P&gt;In customer conversations, we consistently hear three blockers:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;&lt;STRONG&gt;Where do we start?&lt;/STRONG&gt; The catalog of AI solutions is expanding faster than most organizations can evaluate.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;What can we trust? &lt;/STRONG&gt;Without a curated channel, every new agent introduces security, compliance, and data governance work—slowing adoption.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;How do we drive adoption?&lt;/STRONG&gt; Even a great agent delivers zero value if employees don’t discover it or embed it in daily workflows.&lt;/LI&gt;
&lt;/OL&gt;
&lt;H3&gt;Find and use agents from the Agent Store&lt;/H3&gt;
&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/microsoft-365/copilot/copilot-agent-store" target="_blank" rel="noopener"&gt;Agent Store&lt;/A&gt; is a curated hub in Microsoft 365 Copilot where users can discover, install, and use agents directly in the flow of work. It brings together agents built by Microsoft, trusted partners, and your own teams, making them accessible across Microsoft 365 surfaces including Teams, Outlook, Word, Excel, and PowerPoint. It provides:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;One central place to find agents from Microsoft, trusted partners, and your own teams.&lt;/LI&gt;
&lt;LI&gt;Personalized discovery experience that surfaces relevant agents based on each user’s work context.&lt;/LI&gt;
&lt;LI&gt;Fully vetted third-party agents designed to help you adopt with confidence.&lt;/LI&gt;
&lt;LI&gt;IT-administered control and oversight on store catalog curation and management.&lt;/LI&gt;
&lt;/UL&gt;
&lt;img&gt;Agent Store homepage, showing a variety of available agents&lt;/img&gt;
&lt;H3&gt;4 pathways to a curated store: start where you are&lt;/H3&gt;
&lt;P&gt;There isn’t a single path to populate your catalog. Most organizations use more than one approach based on their goals and business needs:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;&lt;STRONG&gt;Deploy pre-built agents &lt;/STRONG&gt;from Microsoft and verified partners. This is the fastest route to get started with a new agent, with no development effort required.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Build agents and distribute through Agent Store&lt;/STRONG&gt; using Microsoft Copilot Studio for faster time to value, or the Microsoft 365 Agents Toolkit and Agents SDK for pro-code development and deeper control.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Bring your own agents&lt;/STRONG&gt;. If you’ve already built on Microsoft Foundry, third-party AI platforms, or custom code, you can still onboard them through the Microsoft 365 Agents Toolkit and SDK so they’re discoverable in the Agent Store.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Integrate with the Microsoft Agent 365 SDK&lt;/STRONG&gt; to add enterprise capabilities such as Entra-backed identity, governed Microsoft 365 data access, observability, and cross-surface notifications without a full rebuild.&lt;/LI&gt;
&lt;/OL&gt;
&lt;H3&gt;New Agent Store whitepaper available&lt;/H3&gt;
&lt;P&gt;To help organizations move from AI experimentation to scaled adoption, &lt;A href="https://adoption.microsoft.com/files/agents/AgentStorePracticalGuide.pdf" target="_blank" rel="noopener"&gt;the new Agent Store whitepaper&lt;/A&gt; is now available for download. If you’re responsible for agent adoption strategy, governance, and scaled deployment, the new guide walks through the “why,” the “what,” and the “how.” It includes:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;The business case for a curated, governed Agent Store.&lt;/LI&gt;
&lt;LI&gt;A walkthrough of the Agent Store user experience.&lt;/LI&gt;
&lt;LI&gt;Step-by-step IT deployment guidance for the four pathways.&lt;/LI&gt;
&lt;LI&gt;Getting-started recommendations to help you move from exploration to impact.&lt;/LI&gt;
&lt;/UL&gt;
&lt;H4&gt;Get the guide&lt;/H4&gt;
&lt;P&gt;&lt;A href="https://aka.ms/AgentStoreWhitepaper" target="_blank" rel="noopener"&gt;Download the Agent Store whitepaper&lt;/A&gt;&lt;/P&gt;
&lt;H3&gt;More resources:&lt;/H3&gt;
&lt;UL&gt;
&lt;LI&gt;Documentation: Set up Agent Store in Microsoft 365 Copilot —&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/microsoft-365/copilot/copilot-agent-store?toc=%2Fmicrosoft-365%2Fadmin%2Ftoc.json&amp;amp;bc=%2Fmicrosoft-365%2Fadmin%2Fbreadcrumb%2Ftoc.json&amp;amp;view=o365-worldwide" target="_blank" rel="noopener"&gt;Microsoft Learn&lt;/A&gt;&lt;EM&gt; &lt;/EM&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://aka.ms/PublishingAgents_PartnerGuide" target="_blank" rel="noopener"&gt;Agent Store Partner Guide&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="https://aka.ms/AgentGovernanceAndSecurity" target="_blank" rel="noopener"&gt;Administering and Governing Agents whitepaper&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Tue, 12 May 2026 17:00:00 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/copilot-studio-blog/4-ways-to-build-a-curated-agent-store-and-scale-agent-adoption/ba-p/4518575</guid>
      <dc:creator>AnnaCao</dc:creator>
      <dc:date>2026-05-12T17:00:00Z</dc:date>
    </item>
    <item>
      <title>NEW UPDATES: Administering and Governing Agents whitepaper v3.2</title>
      <link>https://techcommunity.microsoft.com/t5/copilot-studio-blog/new-updates-administering-and-governing-agents-whitepaper-v3-2/ba-p/4517044</link>
      <description>&lt;P&gt;As AI agents move from pilot to production across organizations of every size, the question IT administrators are asking isn't just "How do we build agents?" It's "How do we govern them, at scale, without slowing the business down?"&lt;/P&gt;
&lt;P&gt;That's exactly what the &lt;A href="https://aka.ms/AgentGovernanceAndSecurity" target="_blank" rel="noopener"&gt;&lt;STRONG&gt;Administering and Governing Agents&lt;/STRONG&gt; whitepaper&lt;/A&gt; is designed to help with. Downloaded more than 65,000 times by IT practitioners worldwide, it has now been updated to &lt;STRONG&gt;Version 3.2.&lt;/STRONG&gt; Today you’ll find new and updated guidance that includes security and observability of your agents with &lt;A href="https://www.microsoft.com/en-us/microsoft-agent-365" target="_blank" rel="noopener"&gt;Microsoft Agent 365&lt;/A&gt;, as well as discoverability of agents with the &lt;A href="https://learn.microsoft.com/en-us/microsoft-365/copilot/copilot-agent-store" target="_blank" rel="noopener"&gt;Microsoft 365 Agent Store&lt;/A&gt;. Additionally, there is new guidance on zone-based governance strategies, agent sharing controls, and agent owner reassignment.&lt;/P&gt;
&lt;P&gt;As you continue to investigate, evaluate, or deploy agentic AI for your organization, keep this updated whitepaper on hand.&lt;/P&gt;
&lt;H3&gt;Why governance matters more than ever&lt;/H3&gt;
&lt;P&gt;Agents are being built by IT professionals, developers, and makers alike, using &lt;A href="https://www.microsoft.com/en-us/microsoft-365-copilot/microsoft-copilot-studio/" target="_blank" rel="noopener"&gt;Microsoft Copilot Studio&lt;/A&gt;, &lt;A href="https://learn.microsoft.com/en-us/microsoft-365/copilot/extensibility/agent-builder-build-agents" target="_blank" rel="noopener"&gt;Agent Builder&lt;/A&gt; in Microsoft 365 Copilot, and &lt;A href="https://azure.microsoft.com/en-us/products/ai-foundry/" target="_blank" rel="noopener"&gt;Microsoft Foundry&lt;/A&gt;. With that growth comes real accountability. Organizations carry responsibility for the agents they deploy—including how the agents access data, how they behave, and whether they comply with organizational and regulatory standards.&lt;/P&gt;
&lt;P&gt;Governance can't be an afterthought. It needs to be built into the foundation from the beginning. As a long-time authority on security and governance, you can trust Microsoft to keep you updated with the information you need to run a secure agent ecosystem.&lt;/P&gt;
&lt;H3&gt;What the whitepaper covers&lt;/H3&gt;
&lt;P&gt;The Administering and Governing Agents whitepaper is designed specifically for IT practitioners and IT decision-makers in SMBs and large enterprises. It provides a structured, practical framework for securing and governing agents across the full Microsoft 365 ecosystem, from initial creation through deployment, monitoring, and ongoing management.&lt;/P&gt;
&lt;H4&gt;A governance framework built on three pillars&lt;/H4&gt;
&lt;P&gt;Effective governance starts with fundamentals. The whitepaper organizes core governance principles into three pillars: Policy, Process, and People.&lt;/P&gt;
&lt;img&gt;
&lt;P&gt;Figure 1 Governance pyramid&lt;/P&gt;
&lt;/img&gt;
&lt;P&gt;No set of tools on their own can replace a clear governance strategy and disciplined processes, but the right tools make it far easier to operationalize those practices at scale.&lt;/P&gt;
&lt;H3&gt;Zones: a zonal tiered path to governance maturity&lt;/H3&gt;
&lt;P&gt;One of the most practical elements of the whitepaper is its &lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/guidance/sec-gov-phase2" target="_blank" rel="noopener"&gt;&lt;STRONG&gt;zoned governance model&lt;/STRONG&gt;&lt;/A&gt;, a structured framework that maps agent risk and technical complexity to three distinct governance levels, implemented as Environment Groups with the Copilot Studio admin controls. This model is designed to help IT teams adjust governance practices as agent risk evolves, enabling more intentional selection of controls over time while also still supporting early experimentation.&lt;/P&gt;
&lt;H4&gt;Zone 1: Personal productivity&lt;/H4&gt;
&lt;P&gt;The entry point. Agents assist with personal tasks, interact with Microsoft Graph data, and are confined to individual use. Security and sharing are tightly constrained by default, with connector access limited to approved Microsoft Enterprise plan and Copilot Studio core connections.&lt;/P&gt;
&lt;H4&gt;Zone 2: Team collaboration&lt;/H4&gt;
&lt;P&gt;Builds on Zone 1 with stricter connector controls, scoped Microsoft Entra security group access, and support for departmental agents. Editor access can be granted for collaborative development, but sharing remains governed and limited to internal channels.&lt;/P&gt;
&lt;H4&gt;Zone 3: Enterprise managed&lt;/H4&gt;
&lt;P&gt;The advanced tier for mission-critical or organization-wide agents. Managed by central IT or a Center of Excellence (CoE), with formal ALM pipelines, phased rollout controls, Sentinel integration, and the most rigorous security and compliance requirements.&lt;/P&gt;
&lt;img&gt;
&lt;P&gt;Figure 2 Zonal Governance&lt;/P&gt;
&lt;/img&gt;
&lt;P&gt;Taken together, the zoned model gives organizations a practical way to govern agents as they grow. This methodology provides a clear path to go from individual experimentation to enterprise‑managed deployments without constantly redefining governance expectations. The whitepaper will show you how these zones map to real controls IT teams can apply over time.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://aka.ms/AgentGovernanceAndSecurity" target="_blank" rel="noopener"&gt;Read the whitepaper&lt;/A&gt;&lt;/P&gt;
&lt;H3&gt;Governance pillars in depth&lt;/H3&gt;
&lt;P&gt;The whitepaper walks through three governance pillars applied consistently across each zone: Security controls, Management controls, and Agent Reporting.&lt;/P&gt;
&lt;P&gt;Security controls span the Copilot Studio data policies, Microsoft 365 admin center, SharePoint Online permissions, Microsoft Purview (DLP, Insider Risk Management, Communication Compliance, eDiscovery, Audit), and Microsoft Sentinel for enterprise-grade threat detection.&lt;/P&gt;
&lt;P&gt;Management controls cover environment provisioning, Managed Environments, Application Lifecycle Management (ALM), pipelines, connector management, and agent publishing controls.&lt;/P&gt;
&lt;P&gt;Agent reporting is delivered through four experiences: Inventory, Monitoring, Security, and the Copilot hub, alongside Microsoft Purview for cross-tenant data security and compliance reporting.&lt;/P&gt;
&lt;P&gt;Each of these pillars is explored in detail throughout the whitepaper, covering how to configure controls, interpret reporting signals, and apply governance continuously as your agent deployments grow in scale and complexity.&lt;/P&gt;
&lt;H3&gt;The Agent Store as a governed distribution point&lt;/H3&gt;
&lt;P&gt;The whitepaper also highlights the Microsoft 365 Copilot &lt;A href="https://learn.microsoft.com/en-us/microsoft-365/copilot/copilot-agent-store" target="_blank" rel="noopener"&gt;Agent Store&lt;/A&gt; as a discovery and access point for agents. It covers distribution paths for Microsoft-built, partner-built, custom Copilot Studio, Teams Toolkit, and Agent 365 SDK-enabled agents, and how IT administrators maintain control over what is available and to whom. This is an area we've heard about consistently from admins, partners, and the field. Who gets access to which agents, and how IT stays in control of that, matters.&lt;/P&gt;
&lt;img&gt;
&lt;P&gt;Figure 3 Agent Discoverability via Agent Store&lt;/P&gt;
&lt;/img&gt;
&lt;H3&gt;Summary: What's new in Version 3.2&lt;/H3&gt;
&lt;P&gt;This update brings three focused changes.&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Agent Store as a governed entry point: New coverage of the Microsoft 365 Copilot Agent Store as a discovery and access point for agents, including distribution paths for all five agent categories and how IT administrators maintain control at every stage.&lt;/LI&gt;
&lt;LI&gt;A365 security and observability: Guidance has been introduced, covering onboarding and critical steps for maintaining visibility and security for agent activity across the tenant.&lt;/LI&gt;
&lt;LI&gt;Terminology and graphics updates to comply with name changes and enhancing visuals.&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3&gt;Get started&lt;/H3&gt;
&lt;P&gt;If you are just starting your agent governance journey or looking to put a more robust framework in place, the updated Administering and Governing Agents whitepaper (Version 3.2) is a practical, comprehensive resource for IT teams managing agents in Microsoft 365.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://aka.ms/AgentGovernanceAndSecurity" target="_blank" rel="noopener"&gt;Download the whitepaper&lt;/A&gt; to start building governance that scales with your agents and your business.&lt;/P&gt;</description>
      <pubDate>Mon, 11 May 2026 19:50:47 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/copilot-studio-blog/new-updates-administering-and-governing-agents-whitepaper-v3-2/ba-p/4517044</guid>
      <dc:creator>Joe_Unwin</dc:creator>
      <dc:date>2026-05-11T19:50:47Z</dc:date>
    </item>
    <item>
      <title>Read-only analytics access and custom metrics now available in Microsoft Copilot Studio</title>
      <link>https://techcommunity.microsoft.com/t5/copilot-studio-blog/read-only-analytics-access-and-custom-metrics-now-available-in/ba-p/4516824</link>
      <description>&lt;H3&gt;Turning Copilot Studio analytics into clear business insight&lt;/H3&gt;
&lt;P&gt;As organizations &lt;A href="https://www.microsoft.com/en-us/microsoft-copilot/blog/copilot-studio/6-core-capabilities-to-scale-agent-adoption-in-2026/" target="_blank" rel="noopener"&gt;scale their use of AI agents&lt;/A&gt;, IT teams face a familiar dilemma: How do you give stakeholders the &lt;EM&gt;insight&lt;/EM&gt; they need—without compromising proper &lt;EM&gt;oversight&lt;/EM&gt;?&lt;/P&gt;
&lt;P&gt;Expanding access to analytics can mean granting permissions beyond what’s appropriate for a given role. Meanwhile, locking things down preserves control…but creates bottlenecks to information. This forces teams to rely on a small group of people to export and share data across multiple tools just to answer basic questions on analytics—let alone dig into data that truly demonstrates how effective your agents are.&lt;/P&gt;
&lt;P&gt;So how can your team get the &lt;STRONG&gt;most relevant, important agent metrics&lt;/STRONG&gt; to the &lt;STRONG&gt;right people&lt;/STRONG&gt; with the &lt;STRONG&gt;right level of access&lt;/STRONG&gt;? &lt;A href="https://aka.ms/CopilotStudio" target="_blank" rel="noopener"&gt;Microsoft Copilot Studio&lt;/A&gt; now offers two new analytics capabilities that address this need directly:&lt;/P&gt;
&lt;P&gt;-&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;A href="https://learn.microsoft.com/microsoft-copilot-studio/admin-share-bots?tabs=web#share-an-agents-analytics" target="_blank" rel="noopener"&gt;&lt;STRONG&gt;Analytics Viewer&lt;/STRONG&gt;&lt;/A&gt;&lt;A class="lia-external-url" href="https://learn.microsoft.com/microsoft-copilot-studio/admin-share-bots?tabs=web#share-an-agents-analytics" target="_blank" rel="noopener"&gt;&lt;STRONG&gt; role&lt;/STRONG&gt;&lt;/A&gt;, which allows an agent owner to grant read-only access to an agent’s Analytics page.&lt;/P&gt;
&lt;P&gt;-&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/analytics-custom-metrics" target="_blank" rel="noopener"&gt;&lt;STRONG&gt;Custom metrics&lt;/STRONG&gt;&lt;/A&gt;, which gives teams the flexibility to add and track specific outcome-based measures aligned to their particular goals for an agent.&lt;/P&gt;
&lt;H3&gt;The Analytics Viewer role in Copilot Studio&lt;/H3&gt;
&lt;P&gt;The &lt;A href="https://learn.microsoft.com/microsoft-copilot-studio/admin-share-bots?tabs=web#share-an-agents-analytics" target="_blank" rel="noopener"&gt;Analytics Viewer role&lt;/A&gt; provides view-only access to an agent’s Analytics page, without permission to edit, configure, publish, or share the agent.&lt;/P&gt;
&lt;P&gt;Now generally available, it enables analysts and stakeholders to monitor agent performance securely without affecting agent behavior.&lt;/P&gt;
&lt;P&gt;. When you share an agent with Analytics Viewer permissions:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Users get access only to the Analytics page.&lt;/LI&gt;
&lt;LI&gt;All other agent pages, actions, settings, test panels, and publishing experiences are hidden and unavailable.&lt;/LI&gt;
&lt;LI&gt;Analytics Viewers currently can’t define or update custom metrics and savings.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Tip:&lt;BR /&gt;&lt;EM&gt;For deeper investigation and root-cause analysis, consider granting analysts the&lt;STRONG&gt; Dataverse Bot Transcript Viewer&lt;/STRONG&gt; role. This allows Analytics Viewers to drill into the transcripts for detailed analysis.&lt;/EM&gt;&lt;/P&gt;
&lt;P&gt;See the new &lt;STRONG&gt;Analytics Viewer&lt;/STRONG&gt; option in the agent sharing experience:&lt;/P&gt;
&lt;img /&gt;
&lt;H3&gt;Who should use the Analytics Viewer role in Copilot Studio?&lt;/H3&gt;
&lt;P&gt;Analytics Viewer is designed for &lt;STRONG&gt;users who need visibility without authoring access&lt;/STRONG&gt;. This includes analysts, product owners, business stakeholders, and operations teams that monitor agent performance, trends, and issues.&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;This feature supports clear separation of duties: IT pros and admins manage agents, while analysts consume insights and investigate performance.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Analytics page for Analytics Viewers:&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H3&gt;Why the Analytics Viewer role matters&lt;/H3&gt;
&lt;P&gt;For many organizations, access to analytics has been a major blocker for adopting agents in production. It’s understandable. Teams often need production insights, but only a small set of users should be able to change production agents.&lt;/P&gt;
&lt;P&gt;Until now, customers have typically exported analytics for offline analysis or built custom reporting outside Copilot Studio. With the Analytics Viewer sharing role, agent owners can share analytics safely—without expanding edit permissions.&lt;/P&gt;
&lt;P&gt;Not only does this expand access to insight while maintain proper oversight, it removes a key point of friction, helping teams keep tools streamlined and reduce time to information:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Stakeholders can access analytics directly, on demand&lt;/STRONG&gt;, without relying on exports or intermediaries.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Teams can grant visibility more broadly&lt;/STRONG&gt;, with permissions scoped appropriately to each role.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Business and product stakeholders can monitor performance that updates after every use&lt;/STRONG&gt;, helping them make quicker decisions.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;This shift allows analytics to function as a shared, operational capability across teams, while allowing IT and admins to maintain confidence that governance standards are upheld.&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;&lt;STRONG&gt;&lt;EM&gt;“&lt;/EM&gt;&lt;/STRONG&gt;&lt;EM&gt;The Analytics Viewer role allows us to provide meaningful performance insights to business and operational stakeholders while maintaining strict production governance. It cleanly separates operational visibility from agent configuration and publishing rights.&lt;STRONG&gt;”&lt;/STRONG&gt;&lt;/EM&gt;&lt;/DIV&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;
&lt;P&gt;&lt;STRONG&gt;Mohamed Arhab&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;Solution Architect, City of Montreal&amp;nbsp;&lt;/EM&gt;&lt;/P&gt;
&lt;/DIV&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;/DIV&gt;
&lt;/BLOCKQUOTE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Learn more about the Analytics Viewer role&amp;nbsp;&lt;A href="https://learn.microsoft.com/microsoft-copilot-studio/admin-share-bots?tabs=web#share-an-agents-analytics" target="_blank" rel="noopener"&gt;here&lt;/A&gt;.&lt;/P&gt;
&lt;H2&gt;Custom metrics: measure the outcomes your business cares about&lt;/H2&gt;
&lt;P&gt;As agents become part of business workflows, analytics must move from usage to impact. Standard metrics show activity but don’t answer the key question: &lt;STRONG&gt;did the agent achieve the desired business outcome?&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/analytics-custom-metrics" target="_blank" rel="noopener"&gt;Custom metrics in Copilot Studio&lt;/A&gt; shift analytics from activity to business outcomes - helping you understand not just usage, but the results agents deliver. They enable evaluation in business terms, supporting ROI, decision making, and stakeholder alignment.&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H3&gt;Define success in your own terms&lt;/H3&gt;
&lt;P&gt;With custom metrics, now available in public preview, you define success in terms your business understands. Custom metrics are calculated by analyzing agent conversations.&lt;/P&gt;
&lt;P&gt;You describe:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;The outcome you want to measure.&lt;/LI&gt;
&lt;LI&gt;The set of allowed result categories that represent that outcome.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;These results are classification-based. For example, outcomes might be:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Ticket deflected&lt;/STRONG&gt; or &lt;STRONG&gt;Not deflected&lt;/STRONG&gt;.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Goal completed correctly&lt;/STRONG&gt;, &lt;STRONG&gt;Completed with issues&lt;/STRONG&gt;, or &lt;STRONG&gt;Not completed&lt;/STRONG&gt;.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Converted&lt;/STRONG&gt; or &lt;STRONG&gt;Not converted&lt;/STRONG&gt;.&lt;/LI&gt;
&lt;/UL&gt;
&lt;img /&gt;
&lt;P&gt;Based on your definition, Copilot Studio automatically generates the prompt that will be used to evaluate agent conversations. This helps keep evaluations consistent, while maintaining the focus on &lt;EM&gt;business meaning&lt;/EM&gt; rather than &lt;EM&gt;technical implementation&lt;/EM&gt;.&lt;/P&gt;
&lt;H3&gt;Clear, actionable results provided by custom metrics&lt;/H3&gt;
&lt;P&gt;Each conversation with an agent maps to exactly one outcome category that you define for it. This classification-based approach makes results:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Easier to interpret&lt;/STRONG&gt;, because outcomes are explicit.&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Easier to explain to stakeholders&lt;/STRONG&gt;, because they map to business language.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Easier to act on&lt;/STRONG&gt;, because teams can see clear outcome distributions rather than abstract scores.&lt;/LI&gt;
&lt;/UL&gt;
&lt;img /&gt;
&lt;P&gt;As examples of clear, actionable results, consider the sample use cases from before:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Ticket deflection:&lt;/STRONG&gt; Understand how effectively (how often, by percentage) an agent resolves issues without escalating to human support.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Goal completion quality:&lt;/STRONG&gt; Evaluate whether users are achieving the intended business outcome successfully (yes/no).&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Sales conversion:&lt;/STRONG&gt; Measure how often interactions lead to a qualified outcome or purchase (by percentage or number).&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;When used thoughtfully, these outcome classifications become a shared language across teams, helping organizations operate agents with greater confidence and clarity as they scale.&lt;/P&gt;
&lt;H3&gt;How custom metrics help teams operate more effectively&lt;/H3&gt;
&lt;P&gt;Custom metrics shift agent Analytics pages from being purely a reporting tool to becoming an &lt;STRONG&gt;operational decision-making tool&lt;/STRONG&gt;.&lt;/P&gt;
&lt;P&gt;Teams can align early on what “good” looks like, using a common definition of success across product and business stakeholders. When performance is evaluated based on outcome distributions—not isolated signals—it’s easier to understand what’s working and what isn’t in your agents.&lt;/P&gt;
&lt;P&gt;Over time, this helps facilitate a more intentional approach to agent fine-tuning and iteration:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Teams can see how changes to an agent affect real outcomes.&lt;/LI&gt;
&lt;LI&gt;Decisions can be tied directly to business impact.&lt;/LI&gt;
&lt;LI&gt;Leaders can use measurable results to prioritize investment decisions.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;As agents take on more responsibility, this outcome-first model becomes essential for scale because teams need a clear, consistent way to &lt;STRONG&gt;measure impact, align decisions, and confidently expand&lt;/STRONG&gt; what agents are trusted to do.&lt;/P&gt;
&lt;P&gt;Learn more about custom metrics, now in preview, &lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/analytics-custom-metrics" target="_blank" rel="noopener"&gt;here&lt;/A&gt;.&lt;/P&gt;
&lt;H2&gt;The right insights for the right people&lt;/H2&gt;
&lt;P&gt;Individually, the Analytics Viewer role and custom metrics solve different challenges. Together, they help reshape how teams work with analytics and create more value. Analytics Viewer makes it possible to share analytics safely, without expanding access beyond what’s secure and sensible. Custom metrics help ensure your business can measure what you care about.&lt;/P&gt;
&lt;P&gt;The result is a better balance between &lt;STRONG&gt;insight and oversight&lt;/STRONG&gt;—a fulcrum where visibility can scale across teams without introducing new governance risks. As agents become a core element of business workflows, this combination helps organizations measure impact, demonstrate ROI, and operate agents with greater accuracy and confidence to expand.&lt;/P&gt;
&lt;P&gt;That’s the direction Microsoft Copilot Studio continues to move toward: helping organizations not just build and deploy agents, but analyze them with clarity, confidence, and control at scale.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Explore &lt;/STRONG&gt;&lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/admin-share-bots?tabs=web#share-an-agents-analytics" target="_blank" rel="noopener"&gt;&lt;STRONG&gt;Analytics Viewer&lt;/STRONG&gt;&lt;/A&gt;&lt;STRONG&gt; and &lt;/STRONG&gt;&lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/analytics-custom-metrics" target="_blank" rel="noopener"&gt;&lt;STRONG&gt;custom metrics&lt;/STRONG&gt;&lt;/A&gt;&lt;STRONG&gt; in Copilot Studio today&lt;/STRONG&gt; to start bringing the right metrics to the right people—at the right level of access.&lt;/P&gt;</description>
      <pubDate>Tue, 05 May 2026 20:25:38 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/copilot-studio-blog/read-only-analytics-access-and-custom-metrics-now-available-in/ba-p/4516824</guid>
      <dc:creator>Eran_Manor</dc:creator>
      <dc:date>2026-05-05T20:25:38Z</dc:date>
    </item>
    <item>
      <title>Work IQ API public preview: Build Copilot powered agents with A2A</title>
      <link>https://techcommunity.microsoft.com/t5/copilot-studio-blog/work-iq-api-public-preview-build-copilot-powered-agents-with-a2a/ba-p/4516286</link>
      <description>&lt;P&gt;Today, the Work IQ API is available in public preview, expanding access to the same underlying intelligence capabilities used by Microsoft 365 Copilot for developers and partners.&lt;/P&gt;
&lt;P&gt;Work IQ gives developers programmatic access to Copilot so you can build agents and apps that use Copilot’s grounding, context, and reasoning without wiring raw data or permissions. In this public preview, that access is exposed in three formats—Agent‑to‑Agent (A2A) for agent collaboration, Model Context Protocol (MCP) for IDEs and tools, and REST for app integrations—letting you choose the right surface for your scenario while relying on the same intelligence runtime.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://techcommunity.microsoft.com/blog/microsoft365copilotblog/a-closer-look-at-work-iq/4499789" target="_blank" rel="noopener"&gt;Work IQ&lt;/A&gt; continuously builds context through memory and a semantic index across organizational data, Microsoft 365, line‑of‑business systems, and external sources. &amp;nbsp;The result is a unified foundation for building agents, applications, and workflows grounded in how work actually happens, with enterprise‑grade security and governance built in by default.&lt;/P&gt;
&lt;P&gt;With the Work IQ API, developers build on &lt;STRONG&gt;permission&lt;/STRONG&gt;‑&lt;STRONG&gt;aware intelligence&lt;/STRONG&gt;—context, intent, signals, and skills—so applications and agents understand what’s in motion, what’s been decided, and what needs attention next.&lt;/P&gt;
&lt;P&gt;For developers, this means:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;&lt;STRONG&gt;No raw data wiring:&lt;/STRONG&gt; Work with intelligence instead of directly accessing emails, files, or chats.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;No orchestration overhead:&lt;/STRONG&gt; A single runtime handles context assembly, grounding, skill selection, and tool invocation.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;No security rework&lt;/STRONG&gt;: Enterprise compliance, governance, and access controls are inherited automatically.&lt;/LI&gt;
&lt;/OL&gt;
&lt;H2&gt;Where Work IQ API is available&lt;/H2&gt;
&lt;P&gt;Work IQ API meets developers where they are, supporting four protocols that share the same underlying intelligence runtime. This means that the behavior, grounding, and response quality stay consistent across every surface.&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;&lt;STRONG&gt;Model Context Protocol (MCP) — local server&lt;/STRONG&gt;&lt;STRONG&gt; &lt;/STRONG&gt;(available now): Works with &lt;A href="https://developer.microsoft.com/blog/bringing-work-context-to-your-code-in-github-copilot" target="_blank" rel="noopener"&gt;GitHub Copilot through the Work IQ CLI&lt;/A&gt;, giving MCP-compliant agents governed access to organizational context as tools and resources directly from their IDE or CLI workflow.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Agent-to-Agent (A2A),&lt;/STRONG&gt; new in this public preview: A cloud-hosted protocol that enables custom agents to interact directly with Copilot intelligence as a peer—delegating work, receiving grounded responses, and maintaining context across interactions.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;REST&lt;/STRONG&gt; (coming May 2026): Standard request/response access to Work IQ intelligence for applications and integrations that don't require streaming or agent-to-agent coordination.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Model Context Protocol (MCP) — remote server&lt;/STRONG&gt; (coming May 2026): A hosted MCP server exposing a standard set of tools and skills for interacting with M365 data, so any MCP-aware surface can connect to Work IQ without running a local server.&lt;/LI&gt;
&lt;/OL&gt;
&lt;img /&gt;
&lt;H2&gt;What’s new: Agent-to-Agent (A2A) protocol&lt;/H2&gt;
&lt;P&gt;This public preview introduces the &lt;A href="https://a2a-protocol.org/latest/" target="_blank" rel="noopener"&gt;Agent‑to‑Agent (A2A)&lt;/A&gt; protocol, which allows custom‑built agents to interact directly with the intelligence layer behind Copilot. Over time, A2A will expand to support collaboration across a broader set of agents and tools that support A2A or MCP, meaning agents can work together with specialized, organization‑aware capabilities.&lt;/P&gt;
&lt;P&gt;With A2A, agents communicate as peers. Rather than calling a traditional API and parsing a response, an agent can delegate work to another agent running on Work IQ—an agent with access (subject to the user’s and tenant’s permissions) to organizational context across emails, meetings, files, and conversations—and receive responses that are grounded in enterprise data and governed by the same security and compliance policies.&lt;/P&gt;
&lt;P&gt;Here’s what that looks like in practice: a single A2A message sent to a Copilot agent using standard .NET A2A SDK:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;PRE&gt;var&amp;nbsp;credential&amp;nbsp;=&amp;nbsp;new&amp;nbsp;InteractiveBrowserCredential(clientId);&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; var&amp;nbsp;token&amp;nbsp;=&amp;nbsp;await&amp;nbsp;credential.GetTokenAsync(new&amp;nbsp;(["api://workiq.svc.cloud.microsoft/WorkIQAgent.Ask"]));&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; var&amp;nbsp;httpClient&amp;nbsp;=&amp;nbsp;new&amp;nbsp;HttpClient();&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;httpClient.DefaultRequestHeaders.Authorization&amp;nbsp;=&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; new&amp;nbsp;AuthenticationHeaderValue("Bearer",&amp;nbsp;token.Token);&lt;BR /&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; //&amp;nbsp;Create&amp;nbsp;an&amp;nbsp;A2A&amp;nbsp;client&amp;nbsp;that&amp;nbsp;talks&amp;nbsp;to&amp;nbsp;the&amp;nbsp;default&amp;nbsp;WorkIQ&amp;nbsp;agent&amp;nbsp;endpoint,&amp;nbsp;using&amp;nbsp;the&amp;nbsp;authenticated&amp;nbsp;HttpClient&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; var&amp;nbsp;client&amp;nbsp;=&amp;nbsp;new&amp;nbsp;A2AClient(new&amp;nbsp;Uri("https://workiq.svc.cloud.microsoft/a2a/"),&amp;nbsp;httpClient);&lt;BR /&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; //&amp;nbsp;Compose&amp;nbsp;a&amp;nbsp;query&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; var&amp;nbsp;question&amp;nbsp;=&amp;nbsp;"What&amp;nbsp;meetings&amp;nbsp;do&amp;nbsp;I&amp;nbsp;have&amp;nbsp;tomorrow&amp;nbsp;that&amp;nbsp;I&amp;nbsp;need&amp;nbsp;to&amp;nbsp;prepare&amp;nbsp;for?";&lt;BR /&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; //&amp;nbsp;Send&amp;nbsp;the&amp;nbsp;query&amp;nbsp;and&amp;nbsp;WorkIQ&amp;nbsp;returns&amp;nbsp;an&amp;nbsp;A2A&amp;nbsp;task&amp;nbsp;that&amp;nbsp;when&amp;nbsp;complete&amp;nbsp;will&amp;nbsp;contain&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; //&amp;nbsp;&amp;nbsp;a&amp;nbsp;grounded,&amp;nbsp;permission-trimmed&amp;nbsp;answer&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; var&amp;nbsp;response&amp;nbsp;=&amp;nbsp;await&amp;nbsp;client.SendMessageAsync(new&amp;nbsp;SendMessageRequest&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;{&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;Message&amp;nbsp;=&amp;nbsp;new&amp;nbsp;Message&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;{&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;MessageId&amp;nbsp;=&amp;nbsp;Guid.NewGuid().ToString("N"),&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;Role&amp;nbsp;=&amp;nbsp;Role.User,&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;Parts&amp;nbsp;=&amp;nbsp;[Part.FromText(question)]&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;}&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;});&lt;/PRE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Developers can rely on Work IQ to handle common retrieval and permission-enforcement tasks, reducing the need to build and maintain custom pipelines. The full working samples (C#, Rust, and Swift) are available &lt;A href="https://github.com/microsoft/work-iq-samples" target="_blank" rel="noopener"&gt;here&lt;/A&gt;.&lt;/P&gt;
&lt;H3&gt;What A2A unlocks for developers&lt;/H3&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Multi-agent collaboration&lt;/STRONG&gt;: Your agents delegate to Copilot agents that understand organizational context.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Embedded intelligence via agents&lt;/STRONG&gt;&lt;STRONG&gt; &lt;/STRONG&gt;inside SaaS products that tap into a customer’s real work context.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Enterprise assistants&lt;/STRONG&gt; tailored to specific roles, powered by Copilot’s grounding and reasoning.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Autonomous workflows&lt;/STRONG&gt; where agents hand off tasks, track progress, and exchange structured artifacts.&lt;/LI&gt;
&lt;/UL&gt;
&lt;H2&gt;See A2A in action&lt;/H2&gt;
&lt;DIV style="position: relative; width: 100%; padding-bottom: 56.25%; height: 0; overflow: hidden;"&gt;&lt;IFRAME src="https://medius.microsoft.com/Embed/video-nc/7296c474-5d3b-44af-9860-dc213f74cc23?r=776257266850" title="Work IQ A2A" allowfullscreen="allowfullscreen" frameborder="0" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%;" sandbox="allow-scripts allow-same-origin allow-forms"&gt;&lt;/IFRAME&gt;&lt;/DIV&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The demo above shows an agent sending a natural-language request to Copilot via A2A and receiving a streamed, grounded response—backed by the user’s real organizational data, with no custom retrieval or permissions code. Try it yourself with the &lt;A href="https://github.com/microsoft/work-iq-samples" target="_blank" rel="noopener"&gt;Work IQ samples&lt;/A&gt; on GitHub.&lt;/P&gt;
&lt;H2&gt;How developers can use Work IQ API&lt;/H2&gt;
&lt;P&gt;Whether you’re connecting agents via A2A, surfacing organizational context through MCP in developer surfaces like IDEs and CLIs, or embedding conversational experiences via REST, the patterns share a common foundation.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;A2A &lt;/STRONG&gt;is the natural fit when your agent needs to collaborate with Copilot intelligence as a peer—delegating work, receiving structured results, and maintaining context across interactions. For example, a sales enablement agent could ask to pull together a customer's recent email threads, meeting notes, and shared documents, then use that grounded summary to auto-generate a pre-call brief—without ever touching the raw data itself.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;MCP &lt;/STRONG&gt;is ideal when you need to expose Work IQ data as tools and resources to an existing MCP-compliant agent using standard protocols and tooling, without needing custom, one-off connections. For example, with the &lt;A class="lia-external-url" href="https://learn.microsoft.com/en-us/microsoft-365/copilot/extensibility/workiq-overview" target="_blank" rel="noopener"&gt;Work IQ CLI&lt;/A&gt; running as a local MCP server, a developer using GitHub Copilot can ask, "What did the team decide about the migration timeline?" and get an answer grounded in actual emails, meeting transcripts, and chat threads—right inside their IDE, without leaving their coding workflow.&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;The common thread across these approaches is a shift from isolated AI features to agent‑driven systems that stay aligned with how work actually unfolds.&lt;/P&gt;
&lt;H2&gt;Security and governance&lt;/H2&gt;
&lt;P&gt;Developers can rely on Work IQ for common security and compliance controls. The Work IQ API draws a clear platform boundary between data access and intelligence:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Permissions&lt;/STRONG&gt;: User and tenant permissions, conditional access policies, and sensitivity labels are automatically enforced.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Responses&lt;/STRONG&gt;: Permission-trimmed by design.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Compliance:&lt;/STRONG&gt; All activity operates within the same audit, compliance, and data-loss-prevention boundaries as Microsoft 365.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Because the API exposes intelligence rather than raw data, applications cannot accidentally bypass tenant security or create shadow-AI risks. Agents operate within the tenant’s existing security and compliance controls from day one.&lt;/P&gt;
&lt;H2&gt;What comes next for Work IQ&lt;/H2&gt;
&lt;P&gt;Users with the appropriate license will be able to access Work IQ in customer and partner apps and agents starting today. &amp;nbsp;&lt;/P&gt;
&lt;P&gt;General availability is planned for summer 2026. The Work IQ API will also be available to unlicensed users on a consumption basis in summer 2026.&lt;/P&gt;
&lt;P&gt;We will continue to add more to the Work IQ API in the coming weeks and months. Our investments will target three areas:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;&lt;STRONG&gt;M365 Agent access,&lt;/STRONG&gt; enabling more M365 agents to work with and alongside agents across the Copilot ecosystem, unlocking richer multi-agent collaboration patterns.&lt;/LI&gt;
&lt;LI&gt;Remote &lt;STRONG&gt;MCP server public preview,&lt;/STRONG&gt; exposing Work IQ as tools and related skills for MCP-compliant agents, so any MCP-aware surface can access governed organizational context.&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Deeper connection and &lt;/STRONG&gt;&lt;STRONG&gt;richer intelligence&lt;/STRONG&gt;, expanding across platforms and developer tools in addition to building on top of organizational data with contextual grounding, skills, and tools.&lt;/LI&gt;
&lt;/OL&gt;
&lt;H2&gt;Resources&lt;/H2&gt;
&lt;P&gt;Building with the Work IQ API and have questions? Join our developer communities for support:&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;&lt;STRONG&gt;Microsoft Q&amp;amp;A:&lt;/STRONG&gt; &lt;A href="https://learn.microsoft.com/en-us/answers/tags/466/microsoft-copilot-microsoft-365-copilot-development-routing" target="_blank" rel="noopener"&gt;Microsoft Copilot | Microsoft 365 Copilot | Development&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;GitHub:&lt;/STRONG&gt; Work IQ GitHub repository: &lt;A href="https://github.com/microsoft/work-iq" target="_blank" rel="noopener"&gt;microsoft/work-iq: MCP Server and CLI for accessing Work IQ&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Reddit: &lt;/STRONG&gt;&lt;A href="https://www.reddit.com/r/copilotstudio/" target="_blank" rel="noopener"&gt;copilotstudio&lt;/A&gt; or &lt;A href="https://www.reddit.com/r/microsoft_365_copilot/" target="_blank" rel="noopener"&gt;microsoft_365_copilot&lt;/A&gt; subreddits&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Documentation&lt;/STRONG&gt;: &lt;A href="https://learn.microsoft.com/en-us/microsoft-365/copilot/extensibility/work-iq-api-overview" target="_blank" rel="noopener"&gt;Microsoft Work IQ API (preview) | Microsoft Learn&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Samples&lt;/STRONG&gt;: &lt;A href="https://github.com/microsoft/work-iq-samples" target="_blank" rel="noopener"&gt;https://github.com/microsoft/work-iq-samples&lt;/A&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We can’t wait to see what you build.&lt;/P&gt;</description>
      <pubDate>Mon, 11 May 2026 19:44:43 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/copilot-studio-blog/work-iq-api-public-preview-build-copilot-powered-agents-with-a2a/ba-p/4516286</guid>
      <dc:creator>tolgaki</dc:creator>
      <dc:date>2026-05-11T19:44:43Z</dc:date>
    </item>
    <item>
      <title>Automate agent evaluation with the Evaluation APIs</title>
      <link>https://techcommunity.microsoft.com/t5/copilot-studio-blog/automate-agent-evaluation-with-the-evaluation-apis/ba-p/4511653</link>
      <description>&lt;P&gt;When you build an agent in &lt;A href="https://www.microsoft.com/en-us/microsoft-365-copilot/microsoft-copilot-studio" target="_blank" rel="noopener"&gt;Microsoft Copilot Studio&lt;/A&gt;, you want confidence that it behaves exactly as intended: answering correctly, using the right tools, and following the logic you designed. &lt;A href="https://techcommunity.microsoft.com/blog/copilot-studio-blog/agent-evaluation-in-microsoft-copilot-studio-is-now-generally-available/4507392" target="_blank" rel="noopener"&gt;Agent Evaluation&lt;/A&gt; (generally available) provides this foundation by allowing you to define test sets, run them against your agent, and understand how it performs.&lt;/P&gt;
&lt;P&gt;As agents evolve from experimentation into real production scenarios, this foundation becomes part of an ongoing process. Evaluation is no longer a one-time step, but a continuous part of the development lifecycle. Teams are looking to validate changes quickly, track quality over time, and ensure consistent behavior across updates, environments, and use cases.&lt;/P&gt;
&lt;P&gt;To support this, evaluation scales alongside your agents. Automated evaluation enables teams to expand their testing coverage, run evaluations more frequently, and establish consistent quality signals across the lifecycle. It brings evaluation closer to the way modern systems are built: iterative, data-driven, and continuously improving.&lt;/P&gt;
&lt;P&gt;To fully realize this at scale, evaluation integrates seamlessly into your workflows and systems.&lt;/P&gt;
&lt;P&gt;Now, these same evaluation capabilities can be used programmatically through &lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/analytics-agent-evaluation-rest-api" target="_blank"&gt;Power Platform REST API&lt;/A&gt; and your &lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/analytics-agent-evaluation-automate-tools" target="_blank"&gt;connectors&lt;/A&gt;. Here’s how you can use these Evaluation APIs to automate agent evaluation &lt;U&gt;as part of your development and release &lt;/U&gt;workflows.&lt;/P&gt;
&lt;H2&gt;What you can do with the Evaluation APIs&lt;/H2&gt;
&lt;P&gt;The Evaluation APIs expose the core evaluation experience as programmable endpoints. Using those endpoints, you can trigger evaluations on demand, integrate evalutaions into pipelines and approval workflows, and design processes relying on the results. Whether you prefer a code-first approach with APIs or a low-code experience using &lt;A href="https://learn.microsoft.com/en-us/power-automate/flow-types" target="_blank"&gt;Microsoft Power Automate flows&lt;/A&gt; and &lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/flows-overview" target="_blank"&gt;Copilot Studio agent workflows&lt;/A&gt;, you can easily automate when and how evaluations run – and use the results for quality gateway.&lt;/P&gt;
&lt;P&gt;Here are the capabilities included in the Maker Evaluation API:&lt;/P&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;&lt;table border="1" style="width: 740px; border-width: 1px;"&gt;&lt;thead&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Capability&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;What it does&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/thead&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;List test sets&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Retrieve the test sets configured for your agent&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Run a test set&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Trigger a test set to execute against your agent&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Poll run status&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Poll a running evaluation to see when it completes&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Retrieve results&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Retrieve detailed results including per-test-case scores&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;List historical runs&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;List all previous evaluation runs for reporting or comparison&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;colgroup&gt;&lt;col style="width: 50.00%" /&gt;&lt;col style="width: 50.00%" /&gt;&lt;/colgroup&gt;&lt;/table&gt;&lt;/DIV&gt;
&lt;P&gt;These APIs work with any HTTP client, Python scripts, Azure DevOps pipelines, GitHub Actions, or custom tooling. For teams working in the Power Platform ecosystem, the same actions are available through the &lt;A href="https://learn.microsoft.com/en-us/connectors/custom-connectors/submit-certification" target="_blank" rel="noopener"&gt;Microsoft Copilot Studio certified connector&lt;/A&gt;, which integrates directly with Power Automate flows.&lt;/P&gt;
&lt;H2&gt;When to use Evaluation APIs&lt;/H2&gt;
&lt;P&gt;The Evaluation APIs exist so you can run evaluations without manually triggering them,&amp;nbsp; letting evaluation happen automatically as part of your pipelines, your flows, or your own tools. &lt;U&gt;By default, runs evaluate the agent’s unpublished (draft) version, which makes this especially useful for CI/CD and pre-publish validation. &lt;/U&gt;The Copilot Studio UI is still the right place for one-off, interactive evaluation. Reach for the APIs when you want evaluation to happen on its own.&lt;/P&gt;
&lt;P&gt;Here are three common scenarios.&lt;/P&gt;
&lt;H3&gt;1.&lt;STRONG&gt; &lt;/STRONG&gt;Add evaluation to your CI/CD pipeline&lt;/H3&gt;
&lt;P&gt;When your agent source lives in a repository, every pull request and every merge to main is an opportunity to validate quality before changes reach production. Wire the Evaluation APIs into Azure DevOps, GitHub Actions, or any CI runner: each pipeline run triggers an evaluation, waits for the result, and passes or fails the build based on the score. Quality regressions are caught at PR time, not in production.&lt;/P&gt;
&lt;H3&gt;2. Trigger evaluation from a Power Automate flow&lt;/H3&gt;
&lt;P&gt;Many events that may affect agent quality happen outside Copilot Studio: a knowledge source is updated in SharePoint, a new article is added to a file library, a Dataverse record changes agent behavior. Use Power Automate (with the Microsoft Copilot Studio certified connector) to listen for these events and kick off an evaluation test run automatically, then route the results to Teams, email, or whichever channel your team watches.&lt;/P&gt;
&lt;H3&gt;3.&lt;STRONG&gt; &lt;/STRONG&gt;Embed evaluation in your own tools&lt;/H3&gt;
&lt;P&gt;Sometimes you want evaluation as part of a tool you’re already building: a Center of Excellence dashboard tracking quality across many agents, an admin script that confirms every new agent has been evaluated before publish, or a custom integration that adds evaluation to an existing approval workflow. The APIs let you call evaluation programmatically from any system, with whatever logic fits your scenario.&lt;/P&gt;
&lt;H2&gt;How an evaluation run works through the API&lt;/H2&gt;
&lt;P&gt;The evaluation flow follows a simple pattern: &lt;STRONG&gt;Trigger&lt;/STRONG&gt; → &lt;STRONG&gt;Poll&lt;/STRONG&gt; → &lt;STRONG&gt;Get Results.&lt;/STRONG&gt;&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;&lt;STRONG&gt;Trigger: &lt;/STRONG&gt;Send a POST request to start an evaluation run for a specific test set&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Poll: &lt;/STRONG&gt;Check the run status until it completes (the execution is asynchronous)&lt;/LI&gt;
&lt;LI&gt;&lt;STRONG&gt;Get results: &lt;/STRONG&gt;Retrieve the score and detailed per-test-case outcomes&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;Optionally, you can pass an &lt;STRONG&gt;MCS Connection ID&lt;/STRONG&gt; when triggering a run. This allows the evaluation to run using an authenticated user context, enabling access to tools and knowledge sources that require authentication. Without it, the evaluation will run anonymously.&lt;/P&gt;
&lt;H2&gt;Working with the Evaluation APIs: the key endpoints&lt;/H2&gt;
&lt;P&gt;Below are the core Evaluation API endpoints available today, starting with how to retrieve test sets and trigger evaluation runs programmatically.&lt;/P&gt;
&lt;H2&gt;Prerequisites&lt;/H2&gt;
&lt;P&gt;API Permissions.&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;Go to &lt;A href="https://portal.azure.com" target="_blank"&gt;https://portal.azure.com&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Go to App Registrations&lt;/LI&gt;
&lt;LI&gt;Search for your App&lt;/LI&gt;
&lt;LI&gt;Click API permissions&lt;/LI&gt;
&lt;LI&gt;Click Add a permission&lt;/LI&gt;
&lt;LI&gt;Click APIs my organization uses&lt;/LI&gt;
&lt;LI&gt;Search "Power Platform API"&lt;/LI&gt;
&lt;LI&gt;Click Delegated permissions&lt;/LI&gt;
&lt;LI&gt;Expand CopilotStudio&lt;/LI&gt;
&lt;LI&gt;Select MakerOperations.Read, MakerOperations.ReadWrite&lt;/LI&gt;
&lt;LI&gt;Click Add Permissions&lt;/LI&gt;
&lt;/OL&gt;
&lt;img /&gt;&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H3&gt;Endpoint 1: Retrieve available test sets&lt;/H3&gt;
&lt;P&gt;Use this endpoint to list all evaluation test sets defined for a specific agent.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Request:&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;GET &lt;A href="https://api.test.powerplatform.com/copilotstudio/environments/%7byourEnvironment%7d/bots/%7breplaceWithYourCdsBotId%7d/api/makerevaluation/testsets?api-version=1" target="_blank" rel="noopener"&gt;https://api.powerplatform.com/copilotstudio/environments/{yourEnvironment}/bots/{replaceWithYourCdsBotId}/api/makerevaluation/testsets?api-version=1&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Expected result:&lt;/STRONG&gt;&lt;BR /&gt;Returns the list of maker evaluation test sets associated with the agent.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Sample response:&lt;/STRONG&gt;&lt;/P&gt;
&lt;img /&gt;
&lt;H3&gt;Endpoint 2: Retrieve a specific test set&lt;/H3&gt;
&lt;P&gt;Once you have a test set ID, you can fetch its full definition.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Request&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;GET &lt;A href="https://api.test.powerplatform.com/copilotstudio/environments/%7byourEnvironment%7d/bots/%7breplaceWithYourCdsBotId%7d/api/makerevaluation/testsets/%7byourTestSetId%7d?api-version=1" target="_blank" rel="noopener"&gt;https://api.powerplatform.com/copilotstudio/environments/{yourEnvironment}/bots/{replaceWithYourCdsBotId}/api/makerevaluation/testsets/{yourTestSetId}?api-version=1&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Expected result&lt;/STRONG&gt;&lt;BR /&gt;Returns the full configuration and structure of the selected test set.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Sample response:&lt;/STRONG&gt;&lt;/P&gt;
&lt;img /&gt;
&lt;H3&gt;End point 3: Trigger an evaluation run&lt;/H3&gt;
&lt;P&gt;This endpoint allows you to programmatically start an evaluation run for a given test set.&lt;/P&gt;
&lt;P&gt;The Body consists of a JSON object with the following attributes:&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;McsConnectionId&lt;/STRONG&gt; - string value. If an empty string is provided, the evaluation runs anonymously, meaning tools and knowledge sources are not used. Agents that rely on authenticated connectors, actions, or auth‑gated knowledge sources will therefore produce different (likely worse) evaluation results.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;RunOnPublishedBot&lt;/STRONG&gt; - optional boolean value, defaults to false. Runs against the draft version (true runs against the published version).&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;EvaluationRunName&lt;/STRONG&gt; - optional string value, useful for naming runs in dashboards.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Request&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;POST &lt;A href="https://api.test.powerplatform.com/copilotstudio/environments/%7byourEnvironment%7d/bots/%7breplaceWithYourCdsBotId%7d/api/makerevaluation/testsets/%7byourTestSetId%7d/run?api-version=1" target="_blank" rel="noopener"&gt;https://api.powerplatform.com/copilotstudio/environments/{yourEnvironment}/bots/{replaceWithYourCdsBotId}/api/makerevaluation/testsets/{yourTestSetId}/run?api-version=1&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Body&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;{&lt;/P&gt;
&lt;P&gt;“RunOnPublishedBot”: {boolean value},&lt;/P&gt;
&lt;P&gt;"mcsConnectionId": "{yourMCSConnectionId}",&lt;/P&gt;
&lt;P&gt;“evaluationRunName”: “{yourEvaluationRunName}”,{&lt;/P&gt;
&lt;P&gt;}&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Sample request:&lt;/STRONG&gt;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&lt;STRONG&gt;Sample response:&lt;/STRONG&gt;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&lt;STRONG&gt;Removed the note&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;How to obtain mcsConnectionId&lt;/STRONG&gt;&lt;/P&gt;
&lt;OL&gt;
&lt;LI&gt;Go to: &lt;A href="https://make.powerautomate.com" target="_blank" rel="noopener"&gt;https://make.powerautomate.com&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;Open &lt;STRONG&gt;Connections&lt;/STRONG&gt; from the side menu&lt;/LI&gt;
&lt;LI&gt;Select the relevant &lt;STRONG&gt;Microsoft Copilot Studio&lt;/STRONG&gt; connection&lt;/LI&gt;
&lt;LI&gt;Copy the connection ID from the URL&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;This connection ID will look something like:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://make.powerautomate.com/environments/Default-00000000-0000-0000-0000-000000000000/connections/shared_microsoftcopilotstudio/shared-microsoftcopi-xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx/details" target="_blank" rel="noopener"&gt;https://make.powerautomate.com/environments/Default-00000000-0000-0000-0000-000000000000/connections/shared_microsoftcopilotstudio/shared-microsoftcopi-xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx/details&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Note: One run at a time&lt;/STRONG&gt;&lt;BR /&gt;&amp;nbsp;The API returns HTTP 422 if you try to start a run while another is already in progress for the same agent.&lt;/P&gt;
&lt;H3&gt;Endpoint 4: Get evaluation run status and results&lt;/H3&gt;
&lt;P&gt;After triggering a run, use the returned run ID to retrieve status and results.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Request&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;GET &lt;A href="https://api.test.powerplatform.com/copilotstudio/environments/%7byourEnvironment%7d/bots/%7byourCdsBotId%7d/api/makerevaluation/testruns/%7byourTestRunId%7d?api-version=1" target="_blank" rel="noopener"&gt;https://api.powerplatform.com/copilotstudio/environments/{yourEnvironment}/bots/{yourCdsBotId}/api/makerevaluation/testruns/{yourTestRunId}?api-version=1&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Expected result&lt;/STRONG&gt;&lt;BR /&gt;Returns the status and once completed, the evaluation results.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Sample response:&lt;/STRONG&gt;&lt;/P&gt;
&lt;img /&gt;
&lt;H3&gt;End point 5: List previous evaluation runs&lt;/H3&gt;
&lt;P&gt;This endpoint is useful for tracking trends, building dashboards, and supporting automated decision logic.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Request&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;GET &lt;A href="https://api.test.powerplatform.com/copilotstudio/environments/%7byourEnvironment%7d/bots/%7byourCdsBotId%7d/api/makerevaluation/testruns?api-version=1" target="_blank" rel="noopener"&gt;https://api.powerplatform.com/copilotstudio/environments/{yourEnvironment}/bots/{yourCdsBotId}/api/makerevaluation/testruns?api-version=1&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Expected result&lt;/STRONG&gt;&lt;BR /&gt;Returns an array of previous evaluation runs, each with the same schema as the run details API.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Sample response:&lt;/STRONG&gt;&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H2&gt;Start using the Evaluation APIs today&lt;/H2&gt;
&lt;P&gt;Pick a test set, call the API, and see what your agent scores. That first run gives you a baseline. From there, you can automate evaluations into your workflow, set thresholds, and build the checks that make sense for your team. The APIs are available now. Start simple, and build from there.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://go.microsoft.com/fwlink/p/?linkid=2252408&amp;amp;clcid=0x409&amp;amp;culture=en-us&amp;amp;country=us" target="_blank" rel="noopener"&gt;Sign into Copilot Studio&lt;/A&gt; to get started today.&lt;/P&gt;</description>
      <pubDate>Wed, 29 Apr 2026 18:34:14 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/copilot-studio-blog/automate-agent-evaluation-with-the-evaluation-apis/ba-p/4511653</guid>
      <dc:creator>Efrat_Gilboa</dc:creator>
      <dc:date>2026-04-29T18:34:14Z</dc:date>
    </item>
    <item>
      <title>Hello, World - Welcome to the Copilot Studio Blog!</title>
      <link>https://techcommunity.microsoft.com/t5/copilot-studio-blog/hello-world-welcome-to-the-copilot-studio-blog/ba-p/4509681</link>
      <description>&lt;P&gt;We’re so excited you’re here.&lt;/P&gt;
&lt;P&gt;Today marks the launch of the &lt;A class="lia-external-url" href="https://aka.ms/MCSblog" target="_blank" rel="noopener"&gt;&lt;STRONG&gt;Copilot Studio Tech Community Blog&lt;/STRONG&gt;&lt;/A&gt;, a space for the builders and admins shaping the agent era in the real world.&lt;/P&gt;
&lt;P&gt;Agents are moving from demos to production, so we’ll focus on practical patterns for building, shipping, and governing at scale, beyond what docs and product announcements cover. Makers will find templates and build tactics; IT and security will get governance guidance; developers will get deeper dives on extensibility and production operations.&lt;/P&gt;
&lt;BLOCKQUOTE&gt;
&lt;P&gt;Hit &lt;STRONG&gt;Follow&lt;/STRONG&gt; at the top of the page and introduce yourself in the &lt;A class="lia-external-url" href="https://aka.ms/MCSdiscussions" target="_blank" rel="noopener"&gt;discussion &lt;/A&gt;forum,&amp;nbsp;with what you’re building&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;
&lt;H2&gt;What Is Microsoft Copilot Studio?&lt;/H2&gt;
&lt;P&gt;Microsoft Copilot Studio is Microsoft’s platform for building and governing AI agents across the enterprise, from prototyping to production. For the full product overview and getting-started guidance, &lt;A href="https://microsoft.com/microsoft-copilot/microsoft-copilot-studio" target="_blank" rel="noopener"&gt;visit the Copilot Studio website&lt;/A&gt;.&lt;/P&gt;
&lt;H3&gt;What’s New in Copilot Studio&lt;/H3&gt;
&lt;P&gt;We’re not starting this blog quietly. Here’s a look at three of the biggest updates that have shipped recently.&lt;/P&gt;
&lt;H4&gt;1. &lt;A class="lia-internal-link lia-internal-url lia-internal-url-content-type-blog" href="https://techcommunity.microsoft.com/blog/copilot-studio-blog/agent-evaluation-in-microsoft-copilot-studio-is-now-generally-available/4507392" target="_blank" rel="noopener" data-lia-auto-title="Agent Evaluation — Now Generally Available" data-lia-auto-title-active="0"&gt;Agent Evaluation — Now Generally Available&lt;/A&gt;&lt;/H4&gt;
&lt;P&gt;Testing agents manually, one conversation at a time, doesn’t scale. Agent Evaluation gives makers a built-in, no-code way to test and monitor agent quality, safety, and reliability at scale. Create evaluation sets using AI-generated queries, past test sessions, or your own QA pairs — then run them automatically to catch regressions before they reach users.&lt;/P&gt;
&lt;H4&gt;2. &lt;A class="lia-external-url" href="https://www.microsoft.com/en-us/microsoft-copilot/blog/copilot-studio/computer-using-agents-now-deliver-more-secure-ui-automation-at-scale/" target="_blank" rel="noopener"&gt;Computer-using agents — more secure UI automation at scale&lt;/A&gt;&lt;/H4&gt;
&lt;P&gt;Computer-using agents (CUA) can now automate tasks through user interfaces—clicking, typing, and navigating apps when an API isn’t available—while delivering a more secure approach for UI automation at scale (with stronger controls for admin governance and credential handling).&lt;/P&gt;
&lt;H4&gt;3.&lt;A class="lia-external-url" href="https://www.microsoft.com/en-us/microsoft-copilot/blog/copilot-studio/new-and-improved-multi-agent-orchestration-connected-experiences-and-faster-prompt-iteration/" target="_blank" rel="noopener"&gt; Multi-agent orchestration, connected experiences, and faster prompt iteration&lt;/A&gt;&lt;/H4&gt;
&lt;P&gt;One of the biggest recent updates is improved multi-agent orchestration, alongside new connected experiences and faster prompt iteration, so you can coordinate specialized agents more effectively and refine behavior faster as you move from prototype to production.&lt;/P&gt;
&lt;H3&gt;Resources to Bookmark&lt;/H3&gt;
&lt;DIV class="styles_lia-table-wrapper__h6Xo9 styles_table-responsive__MW0lN"&gt;&lt;table border="1" style="width: 863px; border-width: 1px;"&gt;&lt;thead&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;Resource&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;&lt;STRONG&gt;What It's For&lt;/STRONG&gt;&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/thead&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/" target="_blank" rel="noopener"&gt;Copilot Studio Documentation&lt;/A&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Official product docs, tutorials, and references&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/power-platform/release-plan/2026wave1/microsoft-copilot-studio/" target="_blank" rel="noopener"&gt;2026 Release Wave 1 Plan&lt;/A&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;What's shipping April–September 2026&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;
&lt;P&gt;&lt;A href="https://aka.ms/CSdiscussions" target="_blank" rel="noopener"&gt;Copilot Studio Discussion Space&lt;/A&gt;&lt;/P&gt;
&lt;/td&gt;&lt;td&gt;
&lt;P&gt;Ask questions, share ideas, connect with peers&lt;/P&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;colgroup&gt;&lt;col style="width: 50.00%" /&gt;&lt;col style="width: 50.00%" /&gt;&lt;/colgroup&gt;&lt;/table&gt;&lt;/DIV&gt;
&lt;H3&gt;Next steps&lt;/H3&gt;
&lt;P&gt;&lt;STRONG&gt;1.&lt;/STRONG&gt;&amp;nbsp;Hit &lt;STRONG&gt;Follow&lt;/STRONG&gt; at the top of the page and introduce yourself in the &lt;A class="lia-external-url" href="https://aka.ms/MCSdiscussions" target="_blank" rel="noopener"&gt;discussion forum&lt;/A&gt;&amp;nbsp; with what you’re building&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;2.&lt;/STRONG&gt;&amp;nbsp;New to Copilot Studio? &lt;A class="lia-external-url" href="https://aka.ms/TryCopilotStudio" target="_blank" rel="noopener"&gt;Sign up for the free trial&lt;/A&gt; and bookmark the resources below for docs, release plans, training, and governance guidance.&lt;/P&gt;
&lt;P&gt;We can’t wait to see what you create.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Apr 2026 18:16:55 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/copilot-studio-blog/hello-world-welcome-to-the-copilot-studio-blog/ba-p/4509681</guid>
      <dc:creator>David_Abu</dc:creator>
      <dc:date>2026-04-09T18:16:55Z</dc:date>
    </item>
    <item>
      <title>Agent Evaluation in Microsoft Copilot Studio is now generally available</title>
      <link>https://techcommunity.microsoft.com/t5/copilot-studio-blog/agent-evaluation-in-microsoft-copilot-studio-is-now-generally/ba-p/4507392</link>
      <description>&lt;P&gt;As agents move into production, evaluations help take each build from experimentation to a reliable system. And they help answer the question that matters most in production: Can we trust this agent to behave correctly, consistently, and safely — every time?&lt;/P&gt;
&lt;P&gt;Manual testing simply can't scale to answer that question. Spot-checking responses one-by-one is slow, inconsistent, and not designed for agents that handle hundreds or thousands of interactions. Agent Evaluation in &lt;A href="https://www.microsoft.com/en-us/microsoft-365-copilot/microsoft-copilot-studio/" target="_blank" rel="noopener"&gt;Microsoft Copilot Studio&lt;/A&gt; helps fill that gap.&lt;/P&gt;
&lt;P&gt;Today, we are giving every maker a better way to assess agent behavior at scale—before launch and over the agent's lifecycle. &lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/analytics-agent-evaluation-intro" target="_blank" rel="noopener"&gt;Agent Evaluation&lt;/A&gt; is now generally available.&lt;/P&gt;
&lt;P&gt;Validate production readiness before launch and after every change&lt;/P&gt;
&lt;P&gt;Agent Evaluation is built directly into Copilot Studio—there’s no separate tool to install and no integrations to configure. Within the agent, the evaluation experience provides an end-to-end workflow for creating test cases, running evals, and reviewing results, all without writing a single line of code.&lt;/P&gt;
&lt;P&gt;Whether you're a maker validating readiness before publishing, a quality assurance (QA) team enforcing organizational standards, an agent owner preparing for rollout, or a compliance team that needs documented evidence of agent behavior, Agent Evaluation is designed to integrate into the workflows teams already use to ship and operate agents.&lt;/P&gt;
&lt;H2&gt;Designed to build trust at scale&lt;/H2&gt;
&lt;P&gt;Agent Evaluation is designed for organizations that carry real accountability for the agents they deploy. That means evals need to fit into existing workflows, help gather compliance documentation, and produce results that hold up to scrutiny.&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Versioned and auditable results&lt;/STRONG&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Every evaluation run produces a structured record, including the test set used, the user profile that ran it, the date and duration, and the results from each grader for every test case. These records are available in the evaluation history view, where teams can track performance over time and compare results across runs. For regulated industries and compliance-driven deployments, this record is the artifact that can help demonstrate that an agent was tested against defined behavioral standards before reaching users.&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;Identity-based evaluation&lt;/STRONG&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Each evaluation run is associated with a selected user profile. The agent is evaluated under that identity, using the same knowledge sources, tools, and connectors that the maker accesses in production. This helps ensure evaluation results reflect real-world behavior, rather than a simplified test environment.&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;STRONG&gt;API-based evaluation&lt;/STRONG&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;For teams that operate continuous integration and delivery pipelines, Agent Evaluation is available via API. Teams can retrieve test sets, trigger evaluation runs, and track results programmatically, integrating evals directly into existing deployment workflows to assess agent behavior proactively at scale.&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H2&gt;Running an evaluation: from test case to results&lt;/H2&gt;
&lt;P&gt;Agent Evaluation in Copilot Studio follows a guided workflow that helps makers move from setup to results without disrupting their workflow or leaving the product.&lt;/P&gt;
&lt;H3&gt;Step 1: Create a test set&lt;/H3&gt;
&lt;P&gt;Evaluation starts with &lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/analytics-agent-evaluation-create" target="_blank" rel="noopener"&gt;creating a test set&lt;/A&gt;—a collection of questions or scenarios used to assess an agent’s behavior. Makers can build test sets in multiple ways: uploading a CSV with prepared questions and expected responses, writing targeted questions manually, or generating questions from production conversations based on common topics.&lt;/P&gt;
&lt;P&gt;To help teams save time configuring test questions, Copilot Studio even includes built-in AI generation options:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;The &lt;STRONG&gt;quick question set&lt;/STRONG&gt; generates 10 questions instantly based on the agent’s description, instructions, and capabilities, providing an initial signal with minimal preparation required.&lt;/LI&gt;
&lt;LI&gt;The &lt;STRONG&gt;full question set&lt;/STRONG&gt; generates up to 100 questions drawn from the agent’s knowledge sources or defined topics, helping teams build broader coverage grounded in the agent’s actual content.&lt;/LI&gt;
&lt;/UL&gt;
&lt;img /&gt;
&lt;H3&gt;Step 2: Configure evaluation methods&lt;/H3&gt;
&lt;P&gt;With test cases in place, makers can determine how evaluations measure agent responses by &lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/analytics-agent-evaluation-overview" target="_blank" rel="noopener"&gt;selecting one or more test methods&lt;/A&gt;. Built-in methods cover a range of evaluation dimensions, including:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;General response quality&lt;/LI&gt;
&lt;LI&gt;Semantic meaning relative to an expected answer&lt;/LI&gt;
&lt;LI&gt;Keyword presence&lt;/LI&gt;
&lt;LI&gt;Text similarity&lt;/LI&gt;
&lt;LI&gt;Exact match&lt;/LI&gt;
&lt;LI&gt;Capability usage&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;However, for organizations that need to go beyond these dimensions, &lt;A href="https://www.microsoft.com/microsoft-copilot/blog/copilot-studio/custom-graders-in-copilot-studio-setting-high-standards-for-agent-evals/" target="_blank" rel="noopener"&gt;&lt;STRONG&gt;Custom Graders&lt;/STRONG&gt;&lt;/A&gt;&amp;nbsp; (available as a Classification method) allow makers to encode your organization’s policies, quality standards, or other rules directly into the evaluation.&lt;/P&gt;
&lt;P&gt;Keep in mind, multiple methods can be combined in a single test run, giving teams a layered view of agent performance.&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H3&gt;Step 3: Run the evaluation and review results&lt;/H3&gt;
&lt;P&gt;Once the test set and methods are configured, makers can &lt;A href="https://learn.microsoft.com/en-us/microsoft-copilot-studio/analytics-agent-evaluation-results" target="_blank" rel="noopener"&gt;run the evaluation&lt;/A&gt; directly from Copilot Studio. Results appear in a structured table, with each row representing a test case and each column representing an evaluation method.&lt;/P&gt;
&lt;P&gt;Pass and fail signals are visible immediately, and the &lt;STRONG&gt;Evaluation summary&lt;/STRONG&gt; panel shows aggregated scores across all methods for a given run. Selecting an individual test case opens a detailed view with the agent's full response, the result and explanation from each grader, the expected answer where you've provided one, and the knowledge sources the agent used to generate its response.&lt;/P&gt;
&lt;P&gt;Because a test set can be saved and reused, evaluation becomes a repeatable quality check across agent versions. When a prompt changes, a knowledge source is updated, or a new capability is added, the same test set then runs again—producing consistent, comparable signals that help teams validate changes before they reach end users.&lt;/P&gt;
&lt;img /&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;H2&gt;What's next for Agent Evaluation?&lt;/H2&gt;
&lt;P&gt;General availability establishes the foundation. From here, there are already plans to expand evaluation coverage to support multi-turn conversation, deeper automation, and more of the deployment lifecycle, so organizations can monitor agent reliability at scale.&lt;/P&gt;
&lt;P&gt;The goal is evaluation that travels with your agent from first build through ongoing production use. And you can start today. Open the Evaluation tab in Copilot Studio, choose a test method, and run your first evaluation in minutes. No code required.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://go.microsoft.com/fwlink/p/?linkid=2252408&amp;amp;clcid=0x409&amp;amp;culture=en-us&amp;amp;country=us" target="_blank" rel="noopener"&gt;Log in to Copilot Studio&lt;/A&gt; to start evaluating agents—or &lt;A href="https://learn.microsoft.com/en-us/power-platform/release-plan/2025wave1/microsoft-copilot-studio/planned-features" target="_blank" rel="noopener"&gt;explore the roadmap&lt;/A&gt; to see what's next for Agent Evaluation.&lt;/P&gt;</description>
      <pubDate>Tue, 31 Mar 2026 19:13:36 GMT</pubDate>
      <guid>https://techcommunity.microsoft.com/t5/copilot-studio-blog/agent-evaluation-in-microsoft-copilot-studio-is-now-generally/ba-p/4507392</guid>
      <dc:creator>Efrat_Gilboa</dc:creator>
      <dc:date>2026-03-31T19:13:36Z</dc:date>
    </item>
  </channel>
</rss>

