New Future of Work 2026 | Jaime Teevan, Jenna Butler, Jake Hofman, Rebecca Janssen

Microsoft Research Blog

New Future of Work: AI is driving rapid change, uneven benefits

April 9, 2026 | Jaime Teevan, Sonia Jaffe, Rebecca Janssen, Nancy Baym, Siân Lindley, Bahar Sarrafzadeh, Brent Hecht, Jenna Butler, Jake Hofman, and Sean Rintel

For the past five years, the New Future of Work report has captured how work is changing. This year, the shift feels especially sharp. Previous editions have focused on technology’s role in increasing productivity by automating tasks, accelerating communication, and…

Microsoft Research Podcast

Ideas: Steering AI toward the work future we want

April 9, 2026 | Jaime Teevan, Jenna Butler, Jake Hofman, and Rebecca Janssen

Microsoft Chief Scientist Jaime Teevan and researchers Jenna Butler, Jake Hofman, and Rebecca Janssen unpack the New Future of Work Report 2025 and explore the ideal AI-driven working world. Plus, is AI a tool or a collaborator? And why the answer matters.

Articles

Memento: Teaching LLMs to Manage Their Own Context

April 8, 2026

Vasilis Kontonis, Yuchen Zeng, Shivam Garg, Lingjiao Chen, Hao Tang, Ziyan Wang, Ahmed Awadallah, Eric Horvitz, John Langford, Dimitris Papailiopoulos We taught models to compress their own chain-of-thought mid-generation. Peak KV cache drops 2–3x, throughput nearly doubles, and the…

Articles

BizGenEval：为商业视觉内容生成建立一把真正有用的“标尺”

April 7, 2026

近年来，图像生成模型的飞速发展令人瞩目。从早期的通用图像生成，到如今逐步迈向更具实用价值的视觉内容创作，这一领域正经历从“好看”到“好用”的关键跃迁。然而，在繁荣表象之下，一个核心挑战正日益凸显：现有主流评测基准仍以自然图像为主，缺乏面向商业设计场景的系统性评估，无法有效衡量模型在结构化和多重约束下的表现。与通用图像相比，商业视觉文档往往包含高密度文本、复杂版式结构以及多种视觉元素的协同布局，其...

Articles

MIRA：让AI真正读懂生命体征的”不规则律动”

April 7, 2026

编者按：医疗AI在影像诊断与病历理解领域已展现出惊人的潜力，但一个更根本的挑战正浮出水面：模型能够读懂静态的"快照"，却难以捕捉生命的"动态流转"。现有方法多依赖规则采样与插值预处理来强行对齐数据，面临着信息失真、噪声引入、临床适应性差等局限，更引发深层质疑——若模型始终需要人工规整才能理解时间，那它是否真正“领悟”了生理演变的内在规律？对此，微软亚洲研究院推出MIRA（Medical Irre...

Articles

通用语音识别模型VibeVoice ASR：长达60分钟音频一次性“直出”结构化转写

April 6, 2026

语音识别技术在近年来取得了飞速发展，但在面对真实的复杂场景时，许多系统依然显得有些力不从心：漫长的会议、多人混杂的交谈、极具专业性的行业术语，或是中英夹杂的日常对话…… 传统的语音识别系统往往需要将长音频切分成一个个短小的片段，分别识别后再费力地“拼凑”起来。这种“化整为零”的方法不仅容易丢失上下文，还常常在谁说了什么、什么时候说的等问题上张冠李戴。近日，微软亚洲研究院发布了一款通用的语音识别模...

Articles

AI Next 播客 | 对话李东胜：AI 与人脑，正在彼此“教会”对方什么？

April 6, 2026

《AI Next》是微软亚洲研究院推出的一档利用 AI 技术制作的播客，内容聚焦 AI 前沿技术、科研趋势与社会影响。第一季主要围绕当今智能发展的核心议题，探索前沿趋势。在第六期节目中，我们邀请到微软亚洲研究院首席研究员李东胜博士，一同探讨 AI 与大脑的深层关联。以人脑为灵感而诞生的神经网络，究竟与大脑是单纯的模仿与被模仿关系，还是存在更深度的联结？AI 该如何向历经亿万年进化的人脑学习高效的...

Articles

别让AI在部署后停滞：OEL重塑大模型进化之路

April 6, 2026

大模型部署之后能力还可以持续提升吗？如果去问AI开发者这个问题，在过去，答案大多是否定的。传统的大模型在出厂那一刻，其智力水平就几乎定型了。即便在实际应用中遇到了未见过的新问题，或者被用户反馈指出错误，模型也很难像人类一样，在下一次尝试时立刻吸取教训。这正是当前大语言模型面临的尴尬现状。在现有的主流范式下，AI模型性能的提升高度依赖于离线训练，要么是昂贵的人工标注，要么是预先构建的模拟环境。一...

ADeLe | Three white line icons, showing a circle with a checkmark, a search document, and a set of tools, on a blue‑to‑green gradient background.

Microsoft Research Blog

ADeLe: Predicting and explaining AI performance across tasks

April 1, 2026 | Lexin Zhou and Xing Xie

AI benchmarks report how large language models (LLMs) perform on specific tasks but provide little insight into their underlying capabilities that drive their performance. They do not explain failures or reliably predict outcomes on new tasks. To address this, Microsoft…