AI Frontiers - Microsoft Research: News And Awards

Microsoft Research
AI Frontiers

News & features

Articles

OmniParser for pure vision-based GUI agent

October 8, 2024

By Yadong Lu, Senior Researcher; Jianwei Yang, Principal Researcher; Yelong Shen, Principal Research Manager; Ahmed Awadallah, Partner Research Manager Recent advancements in large vision-language models (VLMs), such as GPT-4V and GPT-4o, have demonstrated considerable promise in driving intelligent agent systems…

Research Forum | Episode 4 - abstract chalkboard background with colorful network nodes and circular icons

Microsoft Research Blog

Microsoft Research Forum Episode 4: The future of multimodal models, a new “small” language model, and other AI updates

September 26, 2024

Explore multimodal & small language models, plus advanced benchmarks for AI evaluation. Microsoft researchers are working on breakthroughs in weather prediction, materials design, even a new kind of computer for AI inference and hard optimization problems.

A summary of insights extracted by using the Eureka framework, shown via two radar charts for multimodal (left) and language (right) capabilities respectively. The radar charts show the best and worst performance observed for each capability.

Microsoft Research Blog

Eureka: Evaluating and understanding progress in AI

September 17, 2024 | Vidhisha Balachandran, Jingya Chen, Neel Joshi, Besmira Nushi, Hamid Palangi, Eduardo Salinas, Vibhav Vineet, James Woffinden-Luey, and Safoora Yousefi

How can we rigorously evaluate and understand state-of-the-art progress in AI? Eureka is an open-source framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings. Learn more about the extended findings.

Research Forum | Episode 4 Talk 2 | Corby Rosset

Articles

Direct Nash Optimization: Teaching language models to self-improve with general preferences

September 3, 2024

This talk discusses teaching language models to self-improve using a preference oracle like GPT-4, framing it as a two-player game to find an optimal policy at a Nash equilibrium, and achieving state-of-the-art win rates against GPT-4 Turbo on benchmarks such…

Stories

Research Forum Brief | September 2024

September 3, 2024

In this episode, learn about the latest multimodal AI models, advanced benchmarks for AI evaluation and model self-improvement, and an entirely new kind of computer for AI inference and hard optimization. Discover how these research breakthroughs and more can help…

white line icons on blue and green gradient background

Microsoft Research Blog

Tracing the path to self-adapting AI agents

July 25, 2024 | Ching-An Cheng, Adith Swaminathan, and Allen Nie

Introducing Trace, Microsoft and Stanford University’s novel AI optimization framework, now available as a Python library. Trace adapts dynamically and optimizes a wide range of applications from language models to robot control.

Microsoft Research Podcast

Abstracts: July 18, 2024

July 18, 2024 | Gretchen Huizinga and Arindam Mitra

Senior Researcher Arindam Mitra introduces AgentInstruct. Using raw data sources, the automated multi-agent framework can create diverse, high-quality synthetic data at scale for the post-training of small and large language models.

In the news | Microsoft News Center

Why AI sometimes gets it wrong — and big strides to address it

June 20, 2024

Around the time GPT-4 was making headlines for acing standardized tests, Microsoft researchers and collaborators were putting other AI models through a different type of test — one designed to make the models fabricate information.

AutoGen: White icons representing (from left to right) agents (multi), workflow, tasks, and coding on a blue to purple to pink gradient background.

Microsoft Research Blog

Introducing AutoGen Studio: A low-code interface for building multi-agent workflows

June 17, 2024 | Victor Dibia, Gagan Bansal, Jingya Chen, Suff Syed, Adam Fourney, Erkang (Eric) Zhu, Chi Wang, and Saleema Amershi

AutoGen Studio, built on Microsoft’s flexible open-source AutoGen framework for orchestrating AI agents, provides an intuitive user-friendly interface that enables developers to rapidly build, test, customize, and share multi-agent AI solutions—with little or no coding.

Microsoft Research AI Frontiers

News & features

Microsoft Research
AI Frontiers