CCI (Common Compiler Infrastructure)
CCI provides a rich infrastructure for working with .NET Assemblies: generating them from source, or rewriting them.
Discover an index of datasets, SDKs, APIs and open-source tools developed by Microsoft researchers and shared with the global academic community below. These experimental technologies—available through Azure AI Foundry Labs (opens in new tab)—offer a glimpse into the future of AI innovation.
CCI provides a rich infrastructure for working with .NET Assemblies: generating them from source, or rewriting them.
Phlat is a new interface for Windows Desktop Search, enabling search through a user’s own e-mail, files, and viewed Web pages. Phlat makes it easy for users to specify queries and filters, attempting to integrate…
This program is for use with the LaTeX pf and pf2 packages. It makes the symbolic labels of proof steps the same as the printed step numbers.
Pex (Program EXploration) is a white-box test generation tool. Given a hand-written parameterized unit test, Pex analyzes the code to determine relevant test inputs fully automatically. The result is a traditional unit test suite with…
This dataset contains sentences and short paragraphs with corresponding shorter (compressed) versions. There are up to five compressions for each input text, together with quality judgements of their meaning preservation and grammaticality. The dataset is…
The SQA dataset was created to explore the task of answering sequences of inter-related questions on HTML tables. It has 6,066 sequences with 17,553 questions in total.
This dataset is based on 14 months of optical data, from February 2015 to April 2016, taken from Microsoft’s optical backbone in North America. This backbone has O(50) optical cross-connects, O(100) WAN segments, and O(1000)…