Discover an index of datasets, SDKs, APIs and open-source tools developed by Microsoft researchers and shared with the global academic community below. These experimental technologies—available through Azure AI Foundry Labs (opens in new tab)—offer a glimpse into the future of AI innovation.
Chart Reader
Chart Reader is a web-based accessibility engine, which enables rendering of accessible visualizations for screen reader uses to read and better understand the visualizations and underlying data.
Analyzing PII Leakage
This repository contains the official code for our IEEE S&P 2023 paper using GPT-2 language models and Flair Named Entity Recognition (NER) models. It allows fine-tuning (i) undefended, (ii) differentially-private and (iii) scrubbed language models…
Syntheseus
Syntheseus is a package for retrosynthetic planning. It contains implementations of common search algorithms and a simple API to wrap custom reaction models and write custom algorithms. It is meant to allow for simple benchmarking…
Guidance
Guidance enables you to control modern language models more effectively and efficiently than traditional prompting or chaining. Guidance programs allow you to interleave generation, prompting, and logical control into a single continuous flow matching how…
HI-ML Multimodal Toolbox
HI-ML toolbox for deep learning for medical imaging and Azure integration. The Microsoft Health Intelligence Machine Learning Toolbox aims at providing low-level and high-level building blocks for Machine Learning / AI researchers and practitioners. It…
Temporal Vision-Language Processing (BioViL-T)
BioViL-T is a Vision-Language model trained on sequences of biomedical image and text data at a scale. It does not require manual annotations and can leverage historical raw clinical image acquisitions and clinical notes. The…
GA4GH TES on Azure
This project is an implementation of the GA4GH Task Execution Service (TES), that provides distributed batch task execution on Microsoft Azure. The TES API is an effort to define a standardized schema and API for describing…
SimpleRacerResearchPlatform
A simple racer research platform that showcases various imitation learning models with a web racing game.
Revizor: a fuzzer to search for microarchitectural leaks in CPUs
This is Revizor, a microarchitectural fuzzer. Instead of finding bugs in programs, Revizor searches for microarchitectural vulnerabilities in CPUs. What is a microarchitectural vulnerability? In the context of Revizor, it is a violation of out expectations…