Loading...
Three white icons on a blue-to-purple gradient background: the first icon shows an image/photo; the second icon depicts a computer monitor with vertical bars; the third icon displays three connected circles with user silhouettes.
Microsoft Research Blog

MMCTAgent: Enabling multimodal reasoning over large video and image collections 

November 12, 2025 | Akshay Nambi, Kavyansh Chourasia, and Tanuja Ganu

MMCTAgent enables dynamic multimodal reasoning with iterative planning and reflection. Built on Microsoft’s AutoGen framework, it integrates language, vision, and temporal understanding for complex tasks like long video and image analysis.

a field of green plants with two people walking in between the rows
Stories

Advancing AI to meet needs of the global majority 

November 12, 2025

AI tools can perform poorly in non-Western languages and lack critical cultural context for many populations. Project Gecko uses small language models to bring vital expertise to farmers in underserved areas using local languages and multi-modal content.

Three white icons on a blue-to-green gradient background: the first icon shows a circle with connected nodes, the second shows a circuit, and the third shows a flowchart
Microsoft Research Blog

BlueCodeAgent: A blue teaming agent enabled by automated red teaming for CodeGen AI 

November 11, 2025 | Chengquan Guo , Yuzhou Nie, Chulin Xie, Zinan Lin, Wenbo Guo, and Bo Li

BlueCodeAgent is an end-to-end blue-teaming framework built to boost code security using automated red-teaming processes, data, and safety rules to guide LLMs’ defensive decisions. Dynamic testing reduces false positives in vulnerability detection.

In the news | Microsoft EMEA Blog

AI Diffusion Report: Mapping Global AI Adoption and Innovation 

November 11, 2025

The AI Economy Institute, Microsoft’s flagship think tank released its AI Diffusion Report, offering comprehensive view of where artificial intelligence is being used, developed, and built globally. The report provides country-level estimates of AI adoption, insights into innovation hubs, AI skills…

In the news | Ted Talk

TED Talk – These AI devices protect nature in real time (Juan M. Lavista Ferres) 

November 10, 2025

If we can put astronauts on the moon, conservationists shouldn't have to hike miles through dense forests to change the batteries on cameras, says Juan M. Lavista Ferres, chief data scientist at the AI for Good Lab. He introduces SPARROW,…

In the news | Microsoft Unlocked

Project SPARROW – Guardians for the planet 

November 10, 2025

In northern central Colombia, Middle Magdalena Valley (MMV) is home to some of the world’s most remarkable biodiversity. Stretching between the central and eastern Andes mountains, this valley is crucial to the health of our planet. But the region—full of…

Awards | ACM SIGMICRO

Esha Choukse receives 2025 SIGMICRO Early Career Award 

November 7, 2025

Choukse was recognized for her foundational contributions to hardware memory compression and to sustainable and efficient datacenter systems.

graphical user interface
Articles

Phi-Ground模型:让AI学会“看屏幕” 

November 6, 2025

编者按:随着多模态和推理模型的快速发展,能够自主理解并操作计算机界面的智能体(Computer Use Agent, CUA)正逐渐成为现实。其中,图形界面定位(GUI Grounding)是实现这一能力的核心环节,它决定了智能体能否准确地完成点击、输入等具体操作。然而,现有模型在关键基准测试中的准确率仍较低,距离实际应用尚有差距。对此,微软亚洲研究院近期发布了技术报告系统分析了 GUI Grou...

Articles

Education in the AI Economy: the AI Economy Institute’s 2025 Fall Cohort 

November 6, 2025

The AI Economy Institute (AIEI) is launching its second cohort of researchers, advancing our mission to understand and accelerate the responsible diffusion of artificial intelligence (AI) across economies, industries, and communities. This year’s theme—Education in the AI Economy—places diffusion at…

  • Previous
  • 1
  • …
  • 3
  • 4
  • 5
  • 6
  • 7
  • …
  • 567
  • Next