Loading...
Smart Replay - flowchart diagram showing the flow between Encoder, State Predictor, and Policy
Microsoft Research Blog

Rethinking imitation learning with Predictive Inverse Dynamics Models 

February 5, 2026 | Pallavi Choudhury, Lukas Schäfer, Chris Lovett, Katja Hofmann, and Sergio Valcarcel Macua

This research looks at why Predictive Inverse Dynamics Models often outperform standard Behavior Cloning in imitation learning. By using simple predictions of what happens next, PIDMs reduce ambiguity and learn from far fewer demonstrations.

Three white line icons on a blue‑to‑purple gradient background: a vertical audio waveform on the left, a globe showing Africa and Europe in the center, and a network on the right.
Microsoft Research Blog

Paza: Introducing automatic speech recognition benchmarks and models for low resource languages 

February 4, 2026 | Mercy Muchai, Kevin Chege, Nick Mumero, and Stephanie Nyairo

Microsoft Research unveils Paza, a human-centered speech pipeline, and PazaBench, the first leaderboard for low-resource languages. It covers 39 African languages and 52 models and is tested with communities in real settings.

In the news | Unlocked

Giving every language a voice 

February 4, 2026

Recognizing what’s at stake, UNESCO (opens in new tab) designated 2022–2032 as the Decade of Indigenous Languages. This highlights a global effort to support revitalization and digital inclusion, and the work that partners in places like Nunavut are helping advance. According to the UNESCO…

In the news | LinkedIn Article

When Minutes Matter: Advancing Wildfire Early Detection with ALERTCalifornia 

February 3, 2026

Strengthening wildfire response takes more than any single institution, any single technology, or any single moment of heroism. It takes sustained collaboration between the people building new tools and the first responders relying on them under the harshest conditions.

Three white icons on a blue‑green gradient: a ribcage scan, a circuit‑style document, and a neural network diagram
Microsoft Research Blog

UniRG: Scaling medical imaging report generation with multimodal reinforcement learning 

January 27, 2026 | Sheng Zhang, Flora Liu, Guanghui Qin, Mu Wei, and Hoifung Poon

AI can help generate medical image reports, but today’s models struggle with varying reporting schemes. Learn how UniRG uses reinforcement learning to boost performance of medical vision-language models.

In the news | Microsoft Unlocked

Stewards of their environment 

January 24, 2026

In the arid expanse of northwestern Kenya, the Kakuma refugee camp has grown into a sprawling community of more than 300,000 displaced individuals from over 20 countries. Originally established in 1992 to shelter young people fleeing the war in Sudan,…

a screenshot of a video game
Articles

从“实心泥塑”到“高精度资产”,TRELLIS.2重构3D生成规则 

January 22, 2026

在 AI 时代,当文生图早已是秒出“大片”,文生视频也能复刻好莱坞级特效时,3D 生成却仍停留在难以令人满意的阶段——细节模糊、结构失真,缺乏立体感。 当你满怀期待地输入“一个透明的玻璃瓶”,AI 却只给出了一个实心的“泥疙瘩”。当你想要一座椰林摇曳、白沙碧海的海滨小镇,得到的却是橡皮泥捏成的模糊雕塑。你希望生成一棵枝叶轻盈飘逸的枫树,AI 却无法完整还原枝叶的自然形态与立体结构。 以上种种都是当...

test hero
Stories

Advancing AI for the physical world 

January 21, 2026

Rho-alpha, which translates natural language commands into control signals for robotic systems doing bimanual manipulation tasks, aims to make physical systems more adaptable by using physical sensing modalities like touch and continuous learning from human feedback.

In the news | Microsoft Signal

Exploring what AI means for education and the next generation 

January 21, 2026

As AI transforms how we work, live and learn, higher education is more than another player — it needs to lead the way. Higher education must strike a balance as it prepares the next generation for a world being reshaped…

  • Previous
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • …
  • 575
  • Next