BAPO – Bounded Attention Prefix Oracle
This repository contains all scripts for re-producing the results of our paper “Lost in Transmission: When and Why LLMs Fail to Reason Globally”.
PadChest-GR: A bilingual grounded radiology reporting benchmark for chest X-rays
The world’s first multimodal, bilingual radiology dataset could reshape the way radiologists and AI systems make sense of X-rays. PadChest-GR, developed by the University of Alicante with Microsoft Research, has the potential to advance research…
ExACT
ExACT is an approach for teaching AI agents to explore more effectively, enabling them to intelligently navigate their environments, gather valuable information, evaluate options, and identify optimal decision-making and planning strategies.
Microsoft Research Accurate Chemistry Collection (MSR-ACC)
The Skala functional will enable more accurate, scalable predictions in computational chemistry. It starts with the largest high-accuracy dataset ever built for training deep-learning-based density functional theory (DFT) models. This dataset underpins Skala—coming soon to…
AI Testing and Evaluation: Learnings from Science and Industry
In the introductory episode of this new series, host Kathleen Sullivan and Senior Director Amanda Craig Deckard explore Microsoft’s efforts to draw on the experience of other domains to help advance the role of AI…