{"id":1169231,"date":"2026-04-21T18:42:10","date_gmt":"2026-04-22T01:42:10","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-blog-post&#038;p=1169231"},"modified":"2026-04-21T19:36:34","modified_gmt":"2026-04-22T02:36:34","slug":"evaluating-proactive-ai-mediators-in-multi-party-conversation-with-promediate","status":"publish","type":"msr-blog-post","link":"https:\/\/www.microsoft.com\/en-us\/research\/articles\/evaluating-proactive-ai-mediators-in-multi-party-conversation-with-promediate\/","title":{"rendered":"Evaluating Proactive AI Mediators in Multi-Party Conversation with\u00a0ProMediate\u00a0"},"content":{"rendered":"\n<p>By&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/liuziyi219.github.io\/\" target=\"_blank\" rel=\"noopener noreferrer\">Ziyi Liu<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>,&nbsp;<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/basarraf\/?msockid=210d6c482974676b0f9f7ac128496684\" target=\"_blank\" rel=\"noreferrer noopener\">Bahar Sarrafzadeh,<\/a>&nbsp;<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/zhoupei\/?msockid=210d6c482974676b0f9f7ac128496684\" target=\"_blank\" rel=\"noreferrer noopener\">Pei Zhou,<\/a>&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.bing.com\/search?q=longqi%20yang&qs=n&form=QBRE&sp=-1&ghc=1&lq=0&pq=longqi%20yang&sc=11-11&sk=&cvid=E46D78483DA847D7A8660CD7995A1F4A\" target=\"_blank\" rel=\"noopener noreferrer\">Longqi Yang<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>,&nbsp;<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/ashisharma\/?msockid=210d6c482974676b0f9f7ac128496684\" target=\"_blank\" rel=\"noreferrer noopener\">Ashish Sharma<\/a>&nbsp;&nbsp;<\/p>\n\n\n\n<p>Imagine you are in a high-stakes group discussion, stuck in a circular argument with no consensus in sight. Now, imagine an AI agent sitting at that table. Unlike traditional tools that wait for a prompt, this agent proactively intervenes at the perfect moment with a breakthrough suggestion.&nbsp;<strong>This scenario&nbsp;represents&nbsp;the emerging shift from passive AI assistants to proactive team collaborators.<\/strong>&nbsp;<\/p>\n\n\n\n<p>As LLMs evolve toward handling multi-party teamwork, they are increasingly expected to navigate complex group dynamics. However, current research often overlooks the nuance of these interactions. While agents are being designed for multi-party&nbsp;settings, we still lack the benchmarks to&nbsp;evaluate&nbsp;the&nbsp;<strong>real-time effectiveness<\/strong>&nbsp;of their interventions or their&nbsp;broader&nbsp;<strong>socio-cognitive intelligence&nbsp;<\/strong>when&nbsp;<strong>working with groups of people.<\/strong>&nbsp;<\/p>\n\n\n\n<p>In real-world dynamics\u2014like a high-stakes budget negotiation\u2014success is not just about the&nbsp;final outcome; it is about navigating hidden interests, managing &#8220;negotiation fatigue,&#8221; and knowing exactly when to speak up to break a deadlock. To address these gaps, we introduce&nbsp;<strong>ProMediate<\/strong>: a new framework from Microsoft&nbsp;Office of Applied&nbsp;Research designed to evaluate the next generation of proactive agents in multi-party negotiations.<\/p>\n\n\n\n<p><\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"612\" height=\"531\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image.png\" alt=\"graphical user interface, text, application, chat or text message\" class=\"wp-image-1169289\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image.png 612w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-300x260.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-207x180.png 207w\" sizes=\"auto, (max-width: 612px) 100vw, 612px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>ProMediate\u00a0moves beyond static benchmarks by introducing a dynamic architecture that evaluates how agents handle the social nuances of human interaction. The framework consists of two integrated parts:<\/p>\n\n\n\n<ol start=\"1\" class=\"wp-block-list\">\n<li><strong>Real Case Scenarios:<\/strong>&nbsp;The framework&nbsp;utilizes&nbsp;a collection of high-stakes, multi-issue negotiation cases. These scenarios are built upon&nbsp;<strong>asymmetric information<\/strong>&nbsp;and&nbsp;<strong>conflicting interests<\/strong>, requiring participants to navigate a complex bargaining space.&nbsp;&nbsp;<\/li>\n<\/ol>\n\n\n\n<ol start=\"2\" class=\"wp-block-list\">\n<li><strong>Proactive Conversation Simulation:<\/strong>&nbsp;ProMediate&nbsp;features a simulation environment with a&nbsp;<strong>plug-and-play proactive mediator<\/strong>&nbsp;role. The environment uses LLM-based agents to mimic human negotiators, complete with distinct personas and mental state trajectories.&nbsp;The mediator&nbsp;monitors&nbsp;the dialogue and&nbsp;determines&nbsp;the&nbsp;optimal&nbsp;<strong>intervention tempo<\/strong>\u2014deciding not only&nbsp;<em>what<\/em>&nbsp;to say, but exactly&nbsp;<em>when<\/em>&nbsp;to intervene to steer the group toward consensus.&nbsp;<\/li>\n<\/ol>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"how-does-our-evaluation-framework-capture-the-dynamic-of-group-consensus-change\"><strong>How&nbsp;does&nbsp;our&nbsp;evaluation framework&nbsp;capture&nbsp;the dynamic of group consensus change?<\/strong>&nbsp;<\/h2>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"936\" height=\"261\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-1.png\" alt=\"Heatmap\" class=\"wp-image-1169290\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-1.png 936w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-1-300x84.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-1-768x214.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-1-240x67.png 240w\" sizes=\"auto, (max-width: 936px) 100vw, 936px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>Instead of focusing only on the&nbsp;final outcome,&nbsp;it&nbsp;&nbsp;tracks&nbsp;<strong>dynamic group decision-making<\/strong>.&nbsp;It&nbsp;extracts&nbsp;<strong>attitudes<\/strong>&nbsp;for each person on each topic and calculates the&nbsp;<strong>agreement score<\/strong>&nbsp;among all parties at each step. This provides a clear&nbsp;<strong>consensus change trend<\/strong>&nbsp;with enriched signals, as&nbsp;researchers&nbsp;can&nbsp;directly&nbsp;observe&nbsp;where the consensus&nbsp;is&nbsp;going&nbsp;up or down, and if the mediator\u2019s intervention improves the consensus or not.&nbsp;&nbsp;<\/p>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"socio-cognitive-level-evaluation\"><strong>Socio-cognitive level evaluation<\/strong>&nbsp;<\/h2>\n\n\n\n<p>Besides consensus&nbsp;tracking&nbsp;which focuses on the conversation-level evaluation, we also evaluate&nbsp;the&nbsp;mediator\u2019s&nbsp;behavior using socio-cognitive intelligence. Single reliance on the consensus change might not reveal the full capability of the mediator, as it is possible that the mediator&nbsp;tries&nbsp;to&nbsp;facilitate&nbsp;the negotiation but&nbsp;that&nbsp;humans&nbsp;won\u2019t&nbsp;follow.&nbsp;So&nbsp;we evaluate&nbsp;the&nbsp;mediator\u2019s behavior&nbsp;by&nbsp;only focusing on those 4 dimensions:&nbsp;perceptual differences, negative emotions, cognitive&nbsp;challenges&nbsp;and&nbsp;communication&nbsp;breakdown.<\/p>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"socially-intelligent-agent-vs-generic-agent\"><strong>Socially Intelligent Agent VS Generic Agent<\/strong>&nbsp;<\/h2>\n\n\n\n<p>A&nbsp;<strong>generic mediator<\/strong>&nbsp;is&nbsp;essentially a&nbsp;general chat-room agent \u2014 it joins the conversation and responds, but without any specialized playbook. A&nbsp;<strong>socially intelligent mediator<\/strong>, on the other hand, is equipped with mediation-specific skills in thinking and strategic planning. We compare two mediator agents in different&nbsp;difficulty&nbsp;settings.&nbsp;&nbsp;<\/p>\n\n\n\n<p>The difference&nbsp;shows up&nbsp;clearly in&nbsp;ProMediate&#8217;s&nbsp;hardest setting, where participants are actively competing. The socially intelligent mediator produced meaningfully larger gains in consensus than the generic baseline, helping the group close more ground toward agreement. It was also noticeably faster to step in, catching friction points before they hardened into deadlock. In short, it&nbsp;didn&#8217;t&nbsp;just talk better \u2014 it acted sooner and hit the right moments.&nbsp;<strong>Social intelligence, it turns out, is what separates a chat-room bystander from a mediator that actually moves the needle.<\/strong>&nbsp;<\/p>\n\n\n\n<p><\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"936\" height=\"351\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-2.png\" alt=\"table\" class=\"wp-image-1169291\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-2.png 936w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-2-300x113.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-2-768x288.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2026\/04\/image-2-240x90.png 240w\" sizes=\"auto, (max-width: 936px) 100vw, 936px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"looking-ahead\"><strong>Looking ahead<\/strong>&nbsp;<\/h2>\n\n\n\n<p>As LLMs continue to expand in capability, the advantages of proactive, socio-cognitive mediation are likely to compound. Our results&nbsp;indicate&nbsp;that while advanced reasoning models show promise, the ability to successfully navigate a multi-party negotiation depends on more than just raw scale;&nbsp;<strong>it requires a specialized interaction strategy that can decode the social &#8220;pulse&#8221; of a room.<\/strong>&nbsp;<\/p>\n\n\n\n<p>For teams building intelligent features on collaborative platforms\u2014from AI meeting assistants to automated conflict resolution tools\u2014this work offers a clear message:&nbsp;<strong>the architecture of a mediator\u2019s proactivity matters as much as the model\u2019s size.&nbsp;<\/strong>By using the&nbsp;<strong>ProMediate<\/strong>&nbsp;metrics as high-signal reward labels, we can transition from simple prompting to sophisticated training. This allows us to develop agents that learn not just&nbsp;<em>what<\/em>&nbsp;to speak, but critically,&nbsp;<em>when<\/em>&nbsp;to speak within the complex rhythm of multi-turn interactions.&nbsp;<\/p>\n\n\n\n<p>As the digital environments where we collaborate grow more complex, the need for principled, socially aware intervention will only become more vital. The core insight of our work is that effective mediation is not just about the final deal:&nbsp;it is about the socio-cognitive intelligence&nbsp;required&nbsp;to guide the journey from conflict to consensus.&nbsp;<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>Read the full paper:&nbsp;<\/strong><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/arxiv.org\/abs\/2510.25224\" target=\"_blank\" rel=\"noopener noreferrer\">[2510.25224] ProMediate: A Socio-cognitive framework for evaluating proactive agents in multi-party negotiation<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>By&nbsp;Ziyi Liu (opens in new tab),&nbsp;Bahar Sarrafzadeh,&nbsp;Pei Zhou,&nbsp;Longqi Yang (opens in new tab),&nbsp;Ashish Sharma&nbsp;&nbsp; Imagine you are in a high-stakes group discussion, stuck in a circular argument with no consensus in sight. Now, imagine an AI agent sitting at that table. Unlike traditional tools that wait for a prompt, this agent proactively intervenes at the [&hellip;]<\/p>\n","protected":false},"author":43305,"featured_media":1169325,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-content-parent":0,"msr_hide_image_in_river":0,"footnotes":""},"research-area":[13556,13554],"msr-locale":[268875],"msr-post-option":[],"class_list":["post-1169231","msr-blog-post","type-msr-blog-post","status-publish","has-post-thumbnail","hentry","msr-research-area-artificial-intelligence","msr-research-area-human-computer-interaction","msr-locale-en_us"],"msr_assoc_parent":[],"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/1169231","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-blog-post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/43305"}],"version-history":[{"count":6,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/1169231\/revisions"}],"predecessor-version":[{"id":1169341,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/1169231\/revisions\/1169341"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/1169325"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=1169231"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=1169231"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=1169231"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=1169231"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}