Discover an index of datasets, SDKs, APIs and open-source tools developed by Microsoft researchers and shared with the global academic community below. These experimental technologies—available through Azure AI Foundry Labs (opens in new tab)—offer a glimpse into the future of AI innovation.
Phlat (Prototype for Helpful Lookup and Tagging)
Phlat is a new interface for Windows Desktop Search, enabling search through a user’s own e-mail, files, and viewed Web pages. Phlat makes it easy for users to specify queries and filters, attempting to integrate…
pfnum: LaTeX Proof-Step Renumbering
This program is for use with the LaTeX pf and pf2 packages. It makes the symbolic labels of proof steps the same as the printed step numbers.
Pex – Automated Whitebox Testing for .NET (32 bit)
Pex (Program EXploration) is a white-box test generation tool. Given a hand-written parameterized unit test, Pex analyzes the code to determine relevant test inputs fully automatically. The result is a traditional unit test suite with…
mcBV
A satisfiability solver for (existential) bit-vector formulas based on the mcSAT framework.
MSR Abstractive Text Compression Dataset
This dataset contains sentences and short paragraphs with corresponding shorter (compressed) versions. There are up to five compressions for each input text, together with quality judgements of their meaning preservation and grammaticality. The dataset is…
Question Sequences for Conversational Question Answering
The SQA dataset was created to explore the task of answering sequences of inter-related questions on HTML tables. It has 6,066 sequences with 17,553 questions in total.
Optical Data
This dataset is based on 14 months of optical data, from February 2015 to April 2016, taken from Microsoft’s optical backbone in North America. This backbone has O(50) optical cross-connects, O(100) WAN segments, and O(1000)…
U-SQL C# Analyzer
Static analysis of MSIL based on the analysis-net infrastructure.
Dafny
Dafny is a verification-aware programming language.
GitHub Project Publication Publication Publication Publication Publication