{"id":6205,"date":"2025-10-27T14:00:00","date_gmt":"2025-10-27T21:00:00","guid":{"rendered":""},"modified":"2026-03-03T14:51:04","modified_gmt":"2026-03-03T22:51:04","slug":"build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio","status":"publish","type":"copilot","link":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/","title":{"rendered":"Build smarter, test smarter: Agent Evaluation in Microsoft Copilot Studio"},"content":{"rendered":"\n<p>\n  As AI agents take on critical roles in business processes, the need for reliable, repeatable testing becomes essential. In the past, agents have been manually tested\u2014typing in questions, hoping for the right answers, and troubleshooting inconsistencies case by case. That time consuming, unscalable, and inconsistent approach that relies on intuition instead of structured testing doesn\u2019t work for enterprise-grade agent deployment. Enterprise makers need testing that is built-in, automated, and at-scale to deploy agents.&nbsp;\n<\/p>\n\n\n\n<p>Today, we are announcing the public preview of <a href=\"https:\/\/learn.microsoft.com\/en-us\/microsoft-copilot-studio\/analytics-agent-evaluation-create\"><strong>Agent Evaluation<\/strong> in Microsoft Copilot Studio<\/a>, bringing rigor directly into the agent-building tool you already use, backed by Microsoft\u2019s end-to-end approach.<\/p>\n\n\n<figure data-wp-context=\"{&quot;imageId&quot;:&quot;69e3c8ff1f944&quot;}\" data-wp-interactive=\"core\/image\" class=\"wp-block-image size-large wp-lightbox-container\"><img decoding=\"async\" data-wp-class--hide=\"state.isContentHidden\" data-wp-class--show=\"state.isContentVisible\" data-wp-init=\"callbacks.setButtonStyles\" data-wp-on-async--click=\"actions.showLightbox\" data-wp-on-async--load=\"callbacks.setButtonStyles\" data-wp-on-async-window--resize=\"callbacks.setButtonStyles\" src=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture1-1024x576.webp\" alt=\"A helpdesk agent in Copilot Studio showing 32 test cases and an evaluation summary of 94%.\" class=\"wp-image-6413 webp-format\" srcset=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture1-1024x576.webp 1024w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture1-300x169.webp 300w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture1-768x432.webp 768w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture1.webp 1430w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" data-orig-src=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture1-1024x576.webp\"><button\n\t\t\tclass=\"lightbox-trigger\"\n\t\t\ttype=\"button\"\n\t\t\taria-haspopup=\"dialog\"\n\t\t\taria-label=\"Enlarge\"\n\t\t\tdata-wp-init=\"callbacks.initTriggerButton\"\n\t\t\tdata-wp-on-async--click=\"actions.showLightbox\"\n\t\t\tdata-wp-style--right=\"state.imageButtonRight\"\n\t\t\tdata-wp-style--top=\"state.imageButtonTop\"\n\t\t>\n\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"12\" height=\"12\" fill=\"none\" viewBox=\"0 0 12 12\">\n\t\t\t\t<path fill=\"#fff\" d=\"M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z\" \/>\n\t\t\t<\/svg>\n\t\t<\/button><\/figure>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-a89b3969 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/learn.microsoft.com\/en-us\/microsoft-copilot-studio\/analytics-agent-evaluation-create\" target=\"_blank\">Learn how to evaluate the performance of your agents<\/a><\/div>\n<\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"introducing-agent-evaluation\">Introducing Agent Evaluation<\/h2>\n\n\n\n<p>Agent Evaluation enables structured, automated testing directly in <a href=\"https:\/\/www.microsoft.com\/en-us\/microsoft-365-copilot\/microsoft-copilot-studio\" target=\"_blank\" rel=\"noreferrer noopener\">Copilot Studio<\/a>, providing makers with a direct and seamless way to create evaluation sets, choose test methods, define success measures for the agent, and then run the test\u2014maximizing the power of model choice that Copilot Studio offers by evaluating agent performance across multiple agent-level models.&nbsp;&nbsp;<\/p>\n\n\n<figure data-wp-context=\"{&quot;imageId&quot;:&quot;69e3c8ff2193f&quot;}\" data-wp-interactive=\"core\/image\" class=\"wp-block-image size-large wp-lightbox-container\"><img decoding=\"async\" data-wp-class--hide=\"state.isContentHidden\" data-wp-class--show=\"state.isContentVisible\" data-wp-init=\"callbacks.setButtonStyles\" data-wp-on-async--click=\"actions.showLightbox\" data-wp-on-async--load=\"callbacks.setButtonStyles\" data-wp-on-async-window--resize=\"callbacks.setButtonStyles\" src=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture2-1024x581.webp\" alt=\"A Helpdesk agent in Copilot Studio, asking the user to start by importing a file\" class=\"wp-image-6417 webp-format\" srcset=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture2-1024x581.webp 1024w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture2-300x170.webp 300w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture2-768x436.webp 768w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture2.webp 1430w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" data-orig-src=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture2-1024x581.webp\"><button\n\t\t\tclass=\"lightbox-trigger\"\n\t\t\ttype=\"button\"\n\t\t\taria-haspopup=\"dialog\"\n\t\t\taria-label=\"Enlarge\"\n\t\t\tdata-wp-init=\"callbacks.initTriggerButton\"\n\t\t\tdata-wp-on-async--click=\"actions.showLightbox\"\n\t\t\tdata-wp-style--right=\"state.imageButtonRight\"\n\t\t\tdata-wp-style--top=\"state.imageButtonTop\"\n\t\t>\n\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"12\" height=\"12\" fill=\"none\" viewBox=\"0 0 12 12\">\n\t\t\t\t<path fill=\"#fff\" d=\"M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z\" \/>\n\t\t\t<\/svg>\n\t\t<\/button><\/figure>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-a89b3969 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/www.microsoft.com\/en-us\/microsoft-365-copilot\/microsoft-copilot-studio\" target=\"_blank\" rel=\"noreferrer noopener\">Start building in Copilot Studio<\/a><\/div>\n<\/div>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"create-evaluation-sets\">Create evaluation sets<\/h3>\n\n\n\n<p>Makers can now upload predefined test sets, reuse recent Test Pane interactions, and add test questions manually. We are also enabling AI-powered generation of test queries from the agent\u2019s metadata, knowledge sources, and more\u2014delivering makers with quick visibility into agent quality without requiring the manual work for expected answers. This allows for early testing, while additional Q&amp;A sets can be manually added for deeper evaluation.&nbsp; <\/p>\n\n\n\n<aside class=\"wp-block-msx-kicker-container\">\n\t<div class=\"wp-block-msx-kicker wp-block-msx-kicker--align-left\" data-bi-an=\"Kicker Left\">\n\t\t<p class=\"wp-block-msx-kicker__title\">build enterprise-ready agents<\/p>\n\t\t<a\n\t\t\tclass=\"wp-block-msx-kicker__cta btn btn-link\"\n\t\t\thref=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/new-resources-and-guidance-to-plan-build-and-operate-enterprise-ready-agents\/\"\n\t\t\ttarget=\"_blank\"\t\t>\n\t\t\t<span>Get the guidance<\/span> <span class=\"glyph-append glyph-append-xsmall wp-block-msx-kicker__glyph glyph-append-chevron-right\"><\/span>\n\t\t<\/a>\n\t<\/div>\n<\/aside>\n\n\n\n<p>Makers can also mix AI-generated queries with manual or imported test sets to expand coverage, helping to evaluate both breadth (common scenarios auto-generated by AI) and depth (organization-specific queries) of agent behavior.<\/p>\n\n\n<figure data-wp-context=\"{&quot;imageId&quot;:&quot;69e3c8ff22a96&quot;}\" data-wp-interactive=\"core\/image\" class=\"wp-block-image size-large wp-lightbox-container\"><img decoding=\"async\" data-wp-class--hide=\"state.isContentHidden\" data-wp-class--show=\"state.isContentVisible\" data-wp-init=\"callbacks.setButtonStyles\" data-wp-on-async--click=\"actions.showLightbox\" data-wp-on-async--load=\"callbacks.setButtonStyles\" data-wp-on-async-window--resize=\"callbacks.setButtonStyles\" src=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture3-1024x713.webp\" alt=\"A helpdesk agent in Copilot Studio where the user is configuring test sets\" class=\"wp-image-6418 webp-format\" srcset=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture3-1024x713.webp 1024w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture3-300x209.webp 300w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture3-768x535.webp 768w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture3.webp 1430w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" data-orig-src=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture3-1024x713.webp\"><button\n\t\t\tclass=\"lightbox-trigger\"\n\t\t\ttype=\"button\"\n\t\t\taria-haspopup=\"dialog\"\n\t\t\taria-label=\"Enlarge\"\n\t\t\tdata-wp-init=\"callbacks.initTriggerButton\"\n\t\t\tdata-wp-on-async--click=\"actions.showLightbox\"\n\t\t\tdata-wp-style--right=\"state.imageButtonRight\"\n\t\t\tdata-wp-style--top=\"state.imageButtonTop\"\n\t\t>\n\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"12\" height=\"12\" fill=\"none\" viewBox=\"0 0 12 12\">\n\t\t\t\t<path fill=\"#fff\" d=\"M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z\" \/>\n\t\t\t<\/svg>\n\t\t<\/button><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"choose-flexible-test-methods\">Choose flexible test methods<\/h3>\n\n\n\n<p>Makers can choose from a wide rage of test methods\u2014whether it is exact or partial matches, advanced similarity metrics, intent recognition, or relevance and completeness, makers can choose the test methods that work for them based on the type of agent they are deploying. This allows makers to mimic how different users judge the agent\u2014from strict checklist compliance to overall helpfulness\u2014giving a comprehensive view of performance.<\/p>\n\n\n<figure data-wp-context=\"{&quot;imageId&quot;:&quot;69e3c8ff23c22&quot;}\" data-wp-interactive=\"core\/image\" class=\"wp-block-image size-large wp-lightbox-container\"><img decoding=\"async\" data-wp-class--hide=\"state.isContentHidden\" data-wp-class--show=\"state.isContentVisible\" data-wp-init=\"callbacks.setButtonStyles\" data-wp-on-async--click=\"actions.showLightbox\" data-wp-on-async--load=\"callbacks.setButtonStyles\" data-wp-on-async-window--resize=\"callbacks.setButtonStyles\" src=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture4-1024x694.webp\" alt=\"A helpdesk agent in Copilot Studio where the user is reviewing test cases\" class=\"wp-image-6419 webp-format\" srcset=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture4-1024x694.webp 1024w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture4-300x203.webp 300w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture4-768x520.webp 768w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture4.webp 1430w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" data-orig-src=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture4-1024x694.webp\"><button\n\t\t\tclass=\"lightbox-trigger\"\n\t\t\ttype=\"button\"\n\t\t\taria-haspopup=\"dialog\"\n\t\t\taria-label=\"Enlarge\"\n\t\t\tdata-wp-init=\"callbacks.initTriggerButton\"\n\t\t\tdata-wp-on-async--click=\"actions.showLightbox\"\n\t\t\tdata-wp-style--right=\"state.imageButtonRight\"\n\t\t\tdata-wp-style--top=\"state.imageButtonTop\"\n\t\t>\n\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"12\" height=\"12\" fill=\"none\" viewBox=\"0 0 12 12\">\n\t\t\t\t<path fill=\"#fff\" d=\"M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z\" \/>\n\t\t\t<\/svg>\n\t\t<\/button><\/figure>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-a89b3969 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/learn.microsoft.com\/en-us\/microsoft-copilot-studio\/analytics-agent-evaluation-overview\" target=\"_blank\" rel=\"noreferrer noopener\">Learn how to choose evaluation methods<\/a><\/div>\n<\/div>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"define-measures-of-agent-success\">Define measures of agent success&nbsp;&nbsp;<\/h3>\n\n\n\n<p>Agent Evaluations allows you to define what constitutes success for your business, whether it is strict keyword matches (lexical alignment) or conceptual, meaning-based matches (semantic alignment).&nbsp;You can also set custom thresholds to ensure your agent meets your organization\u2019s unique standards for accuracy and relevance.<\/p>\n\n\n<figure data-wp-context=\"{&quot;imageId&quot;:&quot;69e3c8ff24c71&quot;}\" data-wp-interactive=\"core\/image\" class=\"wp-block-image size-large wp-lightbox-container\"><img decoding=\"async\" data-wp-class--hide=\"state.isContentHidden\" data-wp-class--show=\"state.isContentVisible\" data-wp-init=\"callbacks.setButtonStyles\" data-wp-on-async--click=\"actions.showLightbox\" data-wp-on-async--load=\"callbacks.setButtonStyles\" data-wp-on-async-window--resize=\"callbacks.setButtonStyles\" src=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture5-1024x622.webp\" alt=\"A helpdesk agent in Copilot Studio where the user is editing a test case to define the passing score\" class=\"wp-image-6420 webp-format\" srcset=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture5-1024x622.webp 1024w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture5-300x182.webp 300w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture5-768x467.webp 768w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture5.webp 1430w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" data-orig-src=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture5-1024x622.webp\"><button\n\t\t\tclass=\"lightbox-trigger\"\n\t\t\ttype=\"button\"\n\t\t\taria-haspopup=\"dialog\"\n\t\t\taria-label=\"Enlarge\"\n\t\t\tdata-wp-init=\"callbacks.initTriggerButton\"\n\t\t\tdata-wp-on-async--click=\"actions.showLightbox\"\n\t\t\tdata-wp-style--right=\"state.imageButtonRight\"\n\t\t\tdata-wp-style--top=\"state.imageButtonTop\"\n\t\t>\n\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"12\" height=\"12\" fill=\"none\" viewBox=\"0 0 12 12\">\n\t\t\t\t<path fill=\"#fff\" d=\"M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z\" \/>\n\t\t\t<\/svg>\n\t\t<\/button><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"execute-evaluations\">Execute evaluations<\/h3>\n\n\n\n<p>Once the dataset is prepared, test methods are chosen, and thresholds are configured, evaluations are executed with a single click. Results are displayed with clear pass or fail indicators, numeric scores on answer quality, and details around the knowledge sources used by the agent. No more guessing as to why an answer failed.<\/p>\n\n\n<figure data-wp-context=\"{&quot;imageId&quot;:&quot;69e3c8ff25cc5&quot;}\" data-wp-interactive=\"core\/image\" class=\"wp-block-image size-large wp-lightbox-container\"><img decoding=\"async\" data-wp-class--hide=\"state.isContentHidden\" data-wp-class--show=\"state.isContentVisible\" data-wp-init=\"callbacks.setButtonStyles\" data-wp-on-async--click=\"actions.showLightbox\" data-wp-on-async--load=\"callbacks.setButtonStyles\" data-wp-on-async-window--resize=\"callbacks.setButtonStyles\" src=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture6-1024x695.webp\" alt=\"A helpdesk agent in Copilot Studio where the user is reviewing the results of a test case\" class=\"wp-image-6421 webp-format\" srcset=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture6-1024x695.webp 1024w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture6-300x203.webp 300w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture6-768x521.webp 768w, https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture6.webp 1430w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" data-orig-src=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/Picture6-1024x695.webp\"><button\n\t\t\tclass=\"lightbox-trigger\"\n\t\t\ttype=\"button\"\n\t\t\taria-haspopup=\"dialog\"\n\t\t\taria-label=\"Enlarge\"\n\t\t\tdata-wp-init=\"callbacks.initTriggerButton\"\n\t\t\tdata-wp-on-async--click=\"actions.showLightbox\"\n\t\t\tdata-wp-style--right=\"state.imageButtonRight\"\n\t\t\tdata-wp-style--top=\"state.imageButtonTop\"\n\t\t>\n\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"12\" height=\"12\" fill=\"none\" viewBox=\"0 0 12 12\">\n\t\t\t\t<path fill=\"#fff\" d=\"M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z\" \/>\n\t\t\t<\/svg>\n\t\t<\/button><\/figure>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-a89b3969 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/learn.microsoft.com\/en-us\/microsoft-copilot-studio\/analytics-agent-evaluation-results\" target=\"_blank\" rel=\"noreferrer noopener\">Learn how to run agent evaluations<\/a><\/div>\n<\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"transforming-agent-quality-from-build-to-continuous-improvement\">\n  Transforming agent quality: From build to continuous improvement&nbsp;\n<\/h2>\n\n\n\n<aside class=\"wp-block-msx-kicker-container\">\n\t<div class=\"wp-block-msx-kicker wp-block-msx-kicker--align-left\" data-bi-an=\"Kicker Left\">\n\t\t<p class=\"wp-block-msx-kicker__title\">6 trends in agent adoption<\/p>\n\t\t<a\n\t\t\tclass=\"wp-block-msx-kicker__cta btn btn-link\"\n\t\t\thref=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/6-core-capabilities-to-scale-agent-adoption-in-2026\/\"\n\t\t\ttarget=\"_blank\"\t\t>\n\t\t\t<span>Read the blog<\/span> <span class=\"glyph-append glyph-append-xsmall wp-block-msx-kicker__glyph glyph-append-chevron-right\"><\/span>\n\t\t<\/a>\n\t<\/div>\n<\/aside>\n\n\n\n<p>Agent Evaluation transforms agent development into a full lifecycle of build, test, and improve. We want makers to have the same rigorous and streamlined quality process for agents as they do for traditional software. By launching evaluations in Copilot Studio, we\u2019re ensuring that every agent can be tested and continuously improved, leading to well-tested agents deployed across the organization.&nbsp;This also enables makers to test agents using different agent-level models for agent orchestration, to find the model that best suits the business process being transformed. You can go from building an agent to testing it in the same interface, all while being confident in Microsoft enterprise-grade permission controls, compliance, and governance capabilities.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"next-steps\">\n  Next steps&nbsp;\n<\/h2>\n\n\n\n<p>To learn how to get started, visit <a href=\"https:\/\/learn.microsoft.com\/en-us\/microsoft-copilot-studio\/analytics-agent-evaluation-create\">Agent Evaluation in Copilot Studio<\/a>.&nbsp;<\/p>\n\n\n\n<p>Check out all the updates live as we ship them,&nbsp;as well as new features released in the next few months here:&nbsp;<a href=\"https:\/\/learn.microsoft.com\/en-us\/microsoft-copilot-studio\/whats-new\" target=\"_blank\" rel=\"noreferrer noopener\">What\u2019s new in Microsoft Copilot Studio<\/a>.&nbsp;<\/p>\n\n\n\n<p>To learn more about Copilot Studio and how it can transform your organization\u2019s productivity,&nbsp;<a href=\"https:\/\/aka.ms\/CopilotStudio\" target=\"_blank\" rel=\"noreferrer noopener\">visit the Copilot Studio website<\/a>&nbsp;or&nbsp;<a href=\"https:\/\/aka.ms\/TryCopilotStudio\" target=\"_blank\" rel=\"noreferrer noopener\">sign up for our free trial today<\/a>.<\/p>\n\n\n\n<p>We look forward to sharing more about Agent Evaluation at the <a href=\"https:\/\/powerplatformconf.com\/#!\/\" target=\"_blank\" rel=\"noreferrer noopener\">Power Platform Community Conference 2025<\/a>.<\/p>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-a89b3969 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/www.microsoft.com\/en-us\/microsoft-365-copilot\/microsoft-copilot-studio\" target=\"_blank\" rel=\"noreferrer noopener\">Build and customize an agent that works for you today<\/a><\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Automated agent testing is now built into Copilot Studio\u2014evaluate performance, improve quality, and scale confidently with Agent Evaluation.<\/p>\n","protected":false},"author":89,"featured_media":6247,"template":"","cs-content-type":[937,934,933],"cs-topic":[999,939,940],"coauthors":[1007],"class_list":["post-6205","copilot","type-copilot","status-publish","has-post-thumbnail","hentry","cs-content-type-feature-releases","cs-content-type-news","cs-content-type-tips-and-guides","cs-topic-agent-adoption","cs-topic-agent-governance","cs-topic-agentic-ai","review-flag-1714037975-198","review-flag-micro-1714037981-307","review-flag-new-1714037972-526","review-flag-publi-1714037979-1000"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.2 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Build smarter, test smarter: Agent Evaluation in Microsoft Copilot Studio | Microsoft Copilot Blog<\/title>\n<meta name=\"description\" content=\"Explore Agent Evaluation in Microsoft Copilot Studio\u2014automated testing, flexible methods, and scalable performance insights for enterprise-grade AI agents.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Build smarter, test smarter: Agent Evaluation in Microsoft Copilot Studio | Microsoft Copilot Blog\" \/>\n<meta property=\"og:description\" content=\"Explore Agent Evaluation in Microsoft Copilot Studio\u2014automated testing, flexible methods, and scalable performance insights for enterprise-grade AI agents.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/\" \/>\n<meta property=\"og:site_name\" content=\"Microsoft Copilot Blog\" \/>\n<meta property=\"article:modified_time\" content=\"2026-03-03T22:51:04+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/SUR25-COMMR-LaptopPro-Snapdragon-Platinum-Office-Collab-01.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1260\" \/>\n\t<meta property=\"og:image:height\" content=\"840\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:image\" content=\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/SUR25-COMMR-LaptopPro-Snapdragon-Platinum-Office-Collab-01.jpg\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"5 minutes\" \/>\n\t<meta name=\"twitter:label2\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data2\" content=\"Efrat Gilboa\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/\"},\"author\":[{\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/author\/efrat-gilboa\/\",\"@type\":\"Person\",\"@name\":\"Efrat Gilboa\"}],\"headline\":\"Build smarter, test smarter: Agent Evaluation in Microsoft Copilot Studio\",\"datePublished\":\"2025-10-27T21:00:00+00:00\",\"dateModified\":\"2026-03-03T22:51:04+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/\"},\"wordCount\":715,\"publisher\":{\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/SUR25-COMMR-LaptopPro-Snapdragon-Platinum-Office-Collab-01.jpg\",\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/\",\"url\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/\",\"name\":\"Build smarter, test smarter: Agent Evaluation in Microsoft Copilot Studio | Microsoft Copilot Blog\",\"isPartOf\":{\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/SUR25-COMMR-LaptopPro-Snapdragon-Platinum-Office-Collab-01.jpg\",\"datePublished\":\"2025-10-27T21:00:00+00:00\",\"dateModified\":\"2026-03-03T22:51:04+00:00\",\"description\":\"Explore Agent Evaluation in Microsoft Copilot Studio\u2014automated testing, flexible methods, and scalable performance insights for enterprise-grade AI agents.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#primaryimage\",\"url\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/SUR25-COMMR-LaptopPro-Snapdragon-Platinum-Office-Collab-01.jpg\",\"contentUrl\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/SUR25-COMMR-LaptopPro-Snapdragon-Platinum-Office-Collab-01.jpg\",\"width\":1260,\"height\":840,\"caption\":\"Photograph of two female executives in an office. One executive is seated, holding a Surface Laptop 13\\\" Platinum device in her right hand. A Surface Pro 12\\\" Platinum device in laptop mode is on another desk, connected to an external monitor. A Surface Arc Mouse is next to the device.\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Copilot Studio\",\"item\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Build smarter, test smarter: Agent Evaluation in Microsoft Copilot Studio\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#website\",\"url\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/\",\"name\":\"Microsoft Copilot Blog\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#organization\",\"name\":\"Microsoft Copilot Blog\",\"url\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2024\/05\/cropped-microsoft_logo_element.webp\",\"contentUrl\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2024\/05\/cropped-microsoft_logo_element.webp\",\"width\":512,\"height\":512,\"caption\":\"Microsoft Copilot Blog\"},\"image\":{\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#\/schema\/person\/af3c935c28fac735b9027b9a7123c803\",\"name\":\"Kristin Gallagher\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/secure.gravatar.com\/avatar\/295fa37b6bb2bbf59603c38b6ac7a7b4b86cd0f736387182fa9d0117f52cdf5e?s=96&d=microsoft&r=ged0626dc21ff91c840c5284a64de46ca\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/295fa37b6bb2bbf59603c38b6ac7a7b4b86cd0f736387182fa9d0117f52cdf5e?s=96&d=microsoft&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/295fa37b6bb2bbf59603c38b6ac7a7b4b86cd0f736387182fa9d0117f52cdf5e?s=96&d=microsoft&r=g\",\"caption\":\"Kristin Gallagher\"},\"url\":\"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/author\/kristingallaghercs\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Build smarter, test smarter: Agent Evaluation in Microsoft Copilot Studio | Microsoft Copilot Blog","description":"Explore Agent Evaluation in Microsoft Copilot Studio\u2014automated testing, flexible methods, and scalable performance insights for enterprise-grade AI agents.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/","og_locale":"en_US","og_type":"article","og_title":"Build smarter, test smarter: Agent Evaluation in Microsoft Copilot Studio | Microsoft Copilot Blog","og_description":"Explore Agent Evaluation in Microsoft Copilot Studio\u2014automated testing, flexible methods, and scalable performance insights for enterprise-grade AI agents.","og_url":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/","og_site_name":"Microsoft Copilot Blog","article_modified_time":"2026-03-03T22:51:04+00:00","og_image":[{"width":1260,"height":840,"url":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/SUR25-COMMR-LaptopPro-Snapdragon-Platinum-Office-Collab-01.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","twitter_image":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/SUR25-COMMR-LaptopPro-Snapdragon-Platinum-Office-Collab-01.jpg","twitter_misc":{"Est. reading time":"5 minutes","Written by":"Efrat Gilboa"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#article","isPartOf":{"@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/"},"author":[{"@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/author\/efrat-gilboa\/","@type":"Person","@name":"Efrat Gilboa"}],"headline":"Build smarter, test smarter: Agent Evaluation in Microsoft Copilot Studio","datePublished":"2025-10-27T21:00:00+00:00","dateModified":"2026-03-03T22:51:04+00:00","mainEntityOfPage":{"@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/"},"wordCount":715,"publisher":{"@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#organization"},"image":{"@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#primaryimage"},"thumbnailUrl":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/SUR25-COMMR-LaptopPro-Snapdragon-Platinum-Office-Collab-01.jpg","inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/","url":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/","name":"Build smarter, test smarter: Agent Evaluation in Microsoft Copilot Studio | Microsoft Copilot Blog","isPartOf":{"@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#primaryimage"},"image":{"@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#primaryimage"},"thumbnailUrl":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/SUR25-COMMR-LaptopPro-Snapdragon-Platinum-Office-Collab-01.jpg","datePublished":"2025-10-27T21:00:00+00:00","dateModified":"2026-03-03T22:51:04+00:00","description":"Explore Agent Evaluation in Microsoft Copilot Studio\u2014automated testing, flexible methods, and scalable performance insights for enterprise-grade AI agents.","breadcrumb":{"@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#primaryimage","url":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/SUR25-COMMR-LaptopPro-Snapdragon-Platinum-Office-Collab-01.jpg","contentUrl":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2025\/10\/SUR25-COMMR-LaptopPro-Snapdragon-Platinum-Office-Collab-01.jpg","width":1260,"height":840,"caption":"Photograph of two female executives in an office. One executive is seated, holding a Surface Laptop 13\" Platinum device in her right hand. A Surface Pro 12\" Platinum device in laptop mode is on another desk, connected to an external monitor. A Surface Arc Mouse is next to the device."},{"@type":"BreadcrumbList","@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/build-smarter-test-smarter-agent-evaluation-in-microsoft-copilot-studio\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/"},{"@type":"ListItem","position":2,"name":"Copilot Studio","item":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/copilot-studio\/"},{"@type":"ListItem","position":3,"name":"Build smarter, test smarter: Agent Evaluation in Microsoft Copilot Studio"}]},{"@type":"WebSite","@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#website","url":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/","name":"Microsoft Copilot Blog","description":"","publisher":{"@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#organization","name":"Microsoft Copilot Blog","url":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2024\/05\/cropped-microsoft_logo_element.webp","contentUrl":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-content\/uploads\/2024\/05\/cropped-microsoft_logo_element.webp","width":512,"height":512,"caption":"Microsoft Copilot Blog"},"image":{"@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/#\/schema\/person\/af3c935c28fac735b9027b9a7123c803","name":"Kristin Gallagher","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/295fa37b6bb2bbf59603c38b6ac7a7b4b86cd0f736387182fa9d0117f52cdf5e?s=96&d=microsoft&r=ged0626dc21ff91c840c5284a64de46ca","url":"https:\/\/secure.gravatar.com\/avatar\/295fa37b6bb2bbf59603c38b6ac7a7b4b86cd0f736387182fa9d0117f52cdf5e?s=96&d=microsoft&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/295fa37b6bb2bbf59603c38b6ac7a7b4b86cd0f736387182fa9d0117f52cdf5e?s=96&d=microsoft&r=g","caption":"Kristin Gallagher"},"url":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/author\/kristingallaghercs\/"}]}},"msxcm_display_generated_audio":false,"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-json\/wp\/v2\/copilot\/6205","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-json\/wp\/v2\/copilot"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-json\/wp\/v2\/types\/copilot"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-json\/wp\/v2\/users\/89"}],"version-history":[{"count":23,"href":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-json\/wp\/v2\/copilot\/6205\/revisions"}],"predecessor-version":[{"id":7275,"href":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-json\/wp\/v2\/copilot\/6205\/revisions\/7275"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-json\/wp\/v2\/media\/6247"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-json\/wp\/v2\/media?parent=6205"}],"wp:term":[{"taxonomy":"cs-content-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-json\/wp\/v2\/cs-content-type?post=6205"},{"taxonomy":"cs-topic","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-json\/wp\/v2\/cs-topic?post=6205"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/microsoft-copilot\/blog\/wp-json\/wp\/v2\/coauthors?post=6205"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}