Three white line icons in a row; a document list, a workflow, and process wheel against a blue and purple gradient background.

Microsoft Research Blog

AutoAdapt: Automated domain adaptation for large language models

April 22, 2026 | Sidharth Sinha, Anson Bastos, Xuchao Zhang, Akshay Nambi, Rujia Wang, and Chetan Bansal

Deploying large language models (LLMs) in real-world, high-stakes settings is harder than it should be. In high-stakes settings like law, medicine, and cloud incident response, performance and reliability can quickly break down because adapting models to domain-specific requirements is a…

Image of two individuals and their laptops discussing by a kitchen table

Articles

Evaluating Proactive AI Mediators in Multi-Party Conversation with ProMediate

April 21, 2026

By Ziyi Liu (opens in new tab), Bahar Sarrafzadeh, Pei Zhou, Longqi Yang (opens in new tab), Ashish Sharma Imagine you are in a high-stakes group discussion, stuck in a circular argument with no consensus in sight. Now, imagine an AI agent sitting at that…

Articles

The Art of Building Verifiers for Computer Use Agents

April 21, 2026

By Corby Rosset, Pratyusha Sharma, Andrew Zhao, Miguel Gonzalez-Fernandez, Ahmed Awadallah We share lessons learned from building a best-in-class verifier for computer use agent trajectories on the web, called the Universal Verifier. False positive rates drop to near zero (vs.…

The Shape of Things to Come podcast | illustration of Amy Luers, Doug Burger, and Ishai Menache

Microsoft Research Podcast

Can we AI our way to a more sustainable world?

April 20, 2026 | Doug Burger, Amy Luers, and Ishai Menache

Doug Burger, sustainability expert Amy Luers, and optimization researcher Ishai Menache examine the global emissions implications of datacenter operations, efficiency gains, and AI's potential across electrification, materials, and food systems.

Microsoft receives 2026 Franz Edelman Award for Achievement in Advanced Analytics, Operations Research and Management Science

April 13, 2026

Microsoft transformed cloud fulfillment with the Intelligent Fulfillment Service (IFS), which integrates machine learning, optimization, and generative AI.

New Future of Work 2026 | Jaime Teevan, Jenna Butler, Jake Hofman, Rebecca Janssen

Microsoft Research Blog

New Future of Work: AI is driving rapid change, uneven benefits

April 9, 2026 | Jaime Teevan, Sonia Jaffe, Rebecca Janssen, Nancy Baym, Siân Lindley, Bahar Sarrafzadeh, Brent Hecht, Jenna Butler, Jake Hofman, and Sean Rintel

For the past five years, the New Future of Work report has captured how work is changing. This year, the shift feels especially sharp. Previous editions have focused on technology’s role in increasing productivity by automating tasks, accelerating communication, and…

Microsoft Research Podcast

Ideas: Steering AI toward the work future we want

April 9, 2026 | Jaime Teevan, Jenna Butler, Jake Hofman, and Rebecca Janssen

Microsoft Chief Scientist Jaime Teevan and researchers Jenna Butler, Jake Hofman, and Rebecca Janssen unpack the New Future of Work Report 2025 and explore the ideal AI-driven working world. Plus, is AI a tool or a collaborator? And why the answer matters.

Articles

Memento: Teaching LLMs to Manage Their Own Context

April 8, 2026

Vasilis Kontonis, Yuchen Zeng, Shivam Garg, Lingjiao Chen, Hao Tang, Ziyan Wang, Ahmed Awadallah, Eric Horvitz, John Langford, Dimitris Papailiopoulos We taught models to compress their own chain-of-thought mid-generation. Peak KV cache drops 2–3x, throughput nearly doubles, and the…

Articles

BizGenEval：为商业视觉内容生成建立一把真正有用的“标尺”

April 7, 2026

近年来，图像生成模型的飞速发展令人瞩目。从早期的通用图像生成，到如今逐步迈向更具实用价值的视觉内容创作，这一领域正经历从“好看”到“好用”的关键跃迁。然而，在繁荣表象之下，一个核心挑战正日益凸显：现有主流评测基准仍以自然图像为主，缺乏面向商业设计场景的系统性评估，无法有效衡量模型在结构化和多重约束下的表现。与通用图像相比，商业视觉文档往往包含高密度文本、复杂版式结构以及多种视觉元素的协同布局，其...