Discover an index of datasets, SDKs, APIs and open-source tools developed by Microsoft researchers and shared with the global academic community below. These experimental technologies—available through Azure AI Foundry Labs (opens in new tab)—offer a glimpse into the future of AI innovation.
RAD-DINO model
RAD-DINO is a vision transformer model trained to encode chest X-rays using the self-supervised learning method DINOv2 (opens in new tab). RAD-DINO is described in detail in RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision (F.…
MAIRA-2 model
MAIRA-2 is a multimodal transformer designed for the generation of grounded or non-grounded radiology reports from chest X-rays. It is described in more detail in MAIRA-2: Grounded Radiology Report Generation (S. Bannur, K. Bouzid et al.,…
RadFact: An LLM-based Evaluation Metric for AI-generated Radiology Reporting
RadFact is a framework for the evaluation of model-generated radiology reports given a ground-truth report, with or without grounding. Leveraging the logical inference capabilities of large language models, RadFact is not a single number but a suite of…
Cheap Permutations
This repository replicates the experiments of the paper “Cheap Permutation Testing”.
KBLaM: Knowledge Base augmented Language Model
KBLaM is a new method for augmenting LLMs with external knowledge. Unlike Retrieval-Augmented Generation, KBLAM eliminates external retrieval modules, and unlike in-context learning, its computational overhead scales linearly with KB size rather than quadratically.
Magentic-One
Magentic-One is a generalist multi-agent system created to address intricate web and file-based tasks. By utilizing an intelligent Orchestrator alongside specialized agents, it facilitates the automation of complex, multi-step activities across various environments.
PIKE-RAG
PIKE-RAG (sPecIalized KnowledgE and Rationale Augmented Generation) framework mainly consists of several basic modules, including document parsing, knowledge extraction, knowledge storage, knowledge retrieval, knowledge organization, knowledge-centric reasoning, and task decomposition and coordination while building coherent…
PadChest-GR dataset
PadChest-GR is a manually annotated, bilingual chest X-ray dataset designed to train and evaluate models for grounded radiology report generation. It includes bounding boxes and comprehensive annotations of all clinically relevant findings.