Discover an index of datasets, SDKs, APIs and open-source tools developed by Microsoft researchers and shared with the global academic community below. These experimental technologies—available through Azure AI Foundry Labs (opens in new tab)—offer a glimpse into the future of AI innovation.
MSR FastRDFStore Package – Data Release
This data release is part of the MSR FastRDFStore Package (https://github.com/Microsoft/FastRDFStore/ (opens in new tab)) and includes the last dump of Freebase, as well as the processed version ready to load directly into FastRDFStore.
Open Solving Library for ODEs
C# library that implements algorithms for the numerical solution of ordinary differential equations. The library incorporates .NET desktop and Silverlight builds.
Microsoft Hyperlapse Mobile
Microsoft Hyperlapse Mobile creates smooth and stabilized time lapses from first-person videos using a Windows Phone or Android device.
Microsoft Pix
With this new, intelligent camera app for your iphone, you can enjoy life’s moments instead of struggling to capture them.
The Microsoft Cognitive Toolkit
The Microsoft Cognitive Toolkit empowers you to harness the intelligence within massive datasets through deep learning by providing uncompromised scaling, speed and accuracy with commercial-grade quality and compatibility with the programming languages and algorithms you…
Microsoft Research Sequential Question Answering (SQA) Dataset
Recent work in semantic parsing for question answering has focused on long and complicated questions, many of which would seem unnatural if asked in a normal conversation between two humans. In an effort to explore…
Visual Question Generation dataset
We introduce this dataset in order to support the novel task of Visual Question Generation (VQG), where, given an image, the system should ‘ask a natural and engaging question’. This dataset can be used to…
Catch the Whole Lot in an Action: Rapid Precise Packet Loss Notification in Data Center
CP drops only the packet payload instead of the entire packet during buffer overload and uses a SACK-like precise ACK (PACK) technique to accurately inform senders of lost packets. The paper appears on NSDI 2014.
NCI-PID-PubMed Genomics Knowledge Base Completion Dataset
This dataset includes a database of regulation relationships among genes and corresponding textual mentions of pairs of genes in PubMed article abstracts. It was derived from the NCI PID Pathway Interaction Database, and the textual…