{"id":1169231,"date":"2026-04-21T18:42:10","date_gmt":"2026-04-22T01:42:10","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-blog-post&#038;p=1169231"},"modified":"2026-04-24T14:39:12","modified_gmt":"2026-04-24T21:39:12","slug":"evaluating-proactive-ai-mediators-in-multi-party-conversation-with-promediate","status":"publish","type":"msr-blog-post","link":"https:\/\/www.microsoft.com\/en-us\/research\/articles\/evaluating-proactive-ai-mediators-in-multi-party-conversation-with-promediate\/","title":{"rendered":"Evaluating Proactive AI Mediators in Multi-Party Conversation with\u00a0ProMediate\u00a0"},"content":{"rendered":"\n<p>By&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/liuziyi219.github.io\/\" target=\"_blank\" rel=\"noopener noreferrer\">Ziyi Liu<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>,&nbsp;<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/basarraf\/?msockid=210d6c482974676b0f9f7ac128496684\" target=\"_blank\" rel=\"noreferrer noopener\">Bahar Sarrafzadeh,<\/a>&nbsp;<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/zhoupei\/?msockid=210d6c482974676b0f9f7ac128496684\" target=\"_blank\" rel=\"noreferrer noopener\">Pei Zhou,<\/a>&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.bing.com\/search?q=longqi%20yang&qs=n&form=QBRE&sp=-1&ghc=1&lq=0&pq=longqi%20yang&sc=11-11&sk=&cvid=E46D78483DA847D7A8660CD7995A1F4A\" target=\"_blank\" rel=\"noopener noreferrer\">Longqi Yang<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>,&nbsp;<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/ashisharma\/?msockid=210d6c482974676b0f9f7ac128496684\" target=\"_blank\" rel=\"noreferrer noopener\">Ashish Sharma<\/a>&nbsp;&nbsp;<\/p>\n\n\n\n<p>Imagine you are in a high-stakes group discussion, stuck in a circular argument with no consensus in sight. Now, imagine an AI agent sitting at that table. Unlike traditional tools that wait for a prompt, this agent proactively intervenes at the perfect moment with a breakthrough suggestion.&nbsp;<strong>This scenario&nbsp;represents&nbsp;the emerging shift from passive AI assistants to proactive team collaborators.<\/strong>&nbsp;<\/p>\n\n\n\n<p>As LLMs evolve toward handling multi-party teamwork, they are increasingly expected to navigate complex group dynamics. However, current research often overlooks the nuance of these interactions. While agents are being designed for multi-party&nbsp;settings, we still lack the benchmarks to&nbsp;evaluate&nbsp;the&nbsp;<strong>real-time effectiveness<\/strong>&nbsp;of their interventions or their&nbsp;broader&nbsp;<strong>socio-cognitive intelligence&nbsp;<\/strong>when&nbsp;<strong>working with groups of people.<\/strong>&nbsp;<\/p>\n\n\n\n<p>In real-world dynamics\u2014like a high-stakes budget negotiation\u2014success is not just about the&nbsp;final outcome; it is about navigating hidden interests, managing &#8220;negotiation fatigue,&#8221; and knowing exactly when to speak up to break a deadlock. To address these gaps, we introduce&nbsp;<strong>ProMediate<\/strong>: a new framework from Microsoft&nbsp;Office of Applied&nbsp;Research designed to evaluate the next generation of proactive agents in multi-party negotiations.<\/p>\n\n\n\n<div style=\"height:10px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"612\" height=\"531\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image.png\" alt=\"graphical user interface, text, application, chat or text message\" class=\"wp-image-1169289\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image.png 612w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-300x260.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-207x180.png 207w\" sizes=\"auto, (max-width: 612px) 100vw, 612px\" \/><\/figure>\n\n\n\n<div style=\"height:20px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p>ProMediate&nbsp;moves beyond static benchmarks by introducing a dynamic architecture that evaluates how agents handle the social nuances of human interaction. The framework consists of two integrated parts:<\/p>\n\n\n\n<ol start=\"1\" class=\"wp-block-list\">\n<li><strong>Real Case Scenarios:<\/strong>&nbsp;The framework&nbsp;utilizes&nbsp;a collection of high-stakes, multi-issue negotiation cases. These scenarios are built upon&nbsp;<strong>asymmetric information<\/strong>&nbsp;and&nbsp;<strong>conflicting interests<\/strong>, requiring participants to navigate a complex bargaining space.&nbsp;&nbsp;<\/li>\n<\/ol>\n\n\n\n<ol start=\"2\" class=\"wp-block-list\">\n<li><strong>Proactive Conversation Simulation:<\/strong>\u00a0ProMediate\u00a0features a simulation environment with a\u00a0<strong>plug-and-play proactive mediator<\/strong>\u00a0role. The environment uses LLM-based agents to mimic human negotiators, complete with distinct personas and mental state trajectories.\u00a0The mediator\u00a0monitors\u00a0the dialogue and\u00a0determines the\u00a0optimal\u00a0<strong>intervention tempo<\/strong>\u2014deciding not only\u00a0<em>what<\/em>\u00a0to say, but exactly\u00a0<em>when<\/em>\u00a0to intervene to steer the group toward consensus.\u00a0<\/li>\n<\/ol>\n\n\n\n<div style=\"height:20px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"how-does-our-evaluation-framework-capture-the-dynamic-of-group-consensus-change\">How\u00a0does\u00a0our\u00a0evaluation framework\u00a0capture\u00a0the dynamic of group consensus change?\u00a0<\/h2>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"936\" height=\"261\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-1.png\" alt=\"Heatmap\" class=\"wp-image-1169290\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-1.png 936w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-1-300x84.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-1-768x214.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-1-240x67.png 240w\" sizes=\"auto, (max-width: 936px) 100vw, 936px\" \/><\/figure>\n\n\n\n<div style=\"height:10px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p>Instead of focusing only on the\u00a0final outcome,\u00a0it\u00a0\u00a0tracks\u00a0<strong>dynamic group decision-making<\/strong>. It\u00a0extracts\u00a0<strong>attitudes<\/strong> for each person on each topic and calculates the\u00a0<strong>agreement score<\/strong>\u00a0among all parties at each step. This provides a clear\u00a0<strong>consensus change trend<\/strong>\u00a0with enriched signals, as\u00a0researchers\u00a0can\u00a0directly\u00a0observe\u00a0where the consensus\u00a0is going\u00a0up or down, and if the mediator\u2019s intervention improves the consensus or not.\u00a0\u00a0<\/p>\n\n\n\n<div style=\"height:20px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"socio-cognitive-level-evaluation\">Socio-cognitive level evaluation<\/h2>\n\n\n\n<p>Besides consensus\u00a0tracking\u00a0which focuses on the conversation-level evaluation, we also evaluate\u00a0the\u00a0mediator\u2019s behavior using socio-cognitive intelligence. Single reliance on the consensus change might not reveal the full capability of the mediator, as it is possible that the mediator\u00a0tries\u00a0to\u00a0facilitate\u00a0the negotiation but\u00a0that\u00a0humans won\u2019t\u00a0follow.\u00a0So\u00a0we evaluate\u00a0the\u00a0mediator\u2019s behavior\u00a0by\u00a0only focusing on those 4 dimensions:\u00a0perceptual differences, negative emotions, cognitive\u00a0challenges\u00a0and\u00a0communication\u00a0breakdown.<\/p>\n\n\n\n<div style=\"height:20px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"socially-intelligent-agent-vs-generic-agent\">Socially Intelligent Agent VS Generic Agent<\/h2>\n\n\n\n<p>A\u00a0<strong>generic mediator<\/strong>\u00a0is\u00a0essentially a\u00a0general chat-room agent\u2014it joins the conversation and responds, but without any specialized playbook. A\u00a0<strong>socially intelligent mediator<\/strong>, on the other hand, is equipped with mediation-specific skills in thinking and strategic planning. We compare two mediator agents in different difficulty settings.\u00a0\u00a0<\/p>\n\n\n\n<p>The difference\u00a0shows up\u00a0clearly in\u00a0ProMediate&#8217;s\u00a0hardest setting, where participants are actively competing. The socially intelligent mediator produced meaningfully larger gains in consensus than the generic baseline, helping the group close more ground toward agreement. It was also noticeably faster to step in, catching friction points before they hardened into deadlock. In short, it\u00a0didn&#8217;t\u00a0just talk better\u2014it acted sooner and hit the right moments. <strong>Social intelligence, it turns out, is what separates a chat-room bystander from a mediator that actually moves the needle.<\/strong>\u00a0<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"936\" height=\"351\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-2.png\" alt=\"table showing results across all scenarios with GPT-4.1\" class=\"wp-image-1169291\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-2.png 936w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-2-300x113.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-2-768x288.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-2-240x90.png 240w\" sizes=\"auto, (max-width: 936px) 100vw, 936px\" \/><\/figure>\n\n\n\n<div style=\"height:10px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"looking-ahead\">Looking ahead<\/h2>\n\n\n\n<p>As LLMs continue to expand in capability, the advantages of proactive, socio-cognitive mediation are likely to compound. Our results&nbsp;indicate&nbsp;that while advanced reasoning models show promise, the ability to successfully navigate a multi-party negotiation depends on more than just raw scale;&nbsp;<strong>it requires a specialized interaction strategy that can decode the social &#8220;pulse&#8221; of a room.<\/strong>&nbsp;<\/p>\n\n\n\n<p>For teams building intelligent features on collaborative platforms\u2014from AI meeting assistants to automated conflict resolution tools\u2014this work offers a clear message:&nbsp;<strong>the architecture of a mediator\u2019s proactivity matters as much as the model\u2019s size.&nbsp;<\/strong>By using the&nbsp;<strong>ProMediate<\/strong>&nbsp;metrics as high-signal reward labels, we can transition from simple prompting to sophisticated training. This allows us to develop agents that learn not just&nbsp;<em>what<\/em>&nbsp;to speak, but critically,&nbsp;<em>when<\/em>&nbsp;to speak within the complex rhythm of multi-turn interactions.&nbsp;<\/p>\n\n\n\n<p>As the digital environments where we collaborate grow more complex, the need for principled, socially aware intervention will only become more vital. The core insight of our work is that effective mediation is not just about the final deal:\u00a0it is about the socio-cognitive intelligence\u00a0required\u00a0to guide the journey from conflict to consensus.<\/p>\n\n\n\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p><strong>Read the full paper:&nbsp;<\/strong><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/arxiv.org\/abs\/2510.25224\" target=\"_blank\" rel=\"noopener noreferrer\">[2510.25224] ProMediate: A Socio-cognitive framework for evaluating proactive agents in multi-party negotiation<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>By&nbsp;Ziyi Liu (opens in new tab),&nbsp;Bahar Sarrafzadeh,&nbsp;Pei Zhou,&nbsp;Longqi Yang (opens in new tab),&nbsp;Ashish Sharma&nbsp;&nbsp; Imagine you are in a high-stakes group discussion, stuck in a circular argument with no consensus in sight. Now, imagine an AI agent sitting at that table. Unlike traditional tools that wait for a prompt, this agent proactively intervenes at the [&hellip;]<\/p>\n","protected":false},"author":43305,"featured_media":1169325,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-content-parent":1160955,"msr_hide_image_in_river":0,"footnotes":""},"research-area":[13556,13554],"msr-locale":[268875],"msr-post-option":[],"class_list":["post-1169231","msr-blog-post","type-msr-blog-post","status-publish","has-post-thumbnail","hentry","msr-research-area-artificial-intelligence","msr-research-area-human-computer-interaction","msr-locale-en_us"],"msr_assoc_parent":{"id":1160955,"type":"group"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/1169231","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-blog-post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/43305"}],"version-history":[{"count":11,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/1169231\/revisions"}],"predecessor-version":[{"id":1169748,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/1169231\/revisions\/1169748"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/1169325"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=1169231"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=1169231"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=1169231"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=1169231"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}