Discover an index of datasets, SDKs, APIs and open-source tools developed by Microsoft researchers and shared with the global academic community below. These experimental technologies—available through Azure AI Foundry Labs (opens in new tab)—offer a glimpse into the future of AI innovation.
Microsoft Research Paraphrase Phrase Tables
This archive contains phrase tables generated by aligning the two paraphrase data sets described in Quirk, Brockett & Dolan (2004) and Dolan, Quirk & Brockett (2004). The alignments are bidirectional, created using the method described…
ESL 123 Mass Noun Examples
The ESL_123_MASS_NOUN dataset is a set of 123 sentences, found on the World Wide Web, that apparently were written by native speakers of languages spoken in China. Each sentence contains an example of at least…
Distributed Scheduling
This simulator provides four distributed algorithms which solve the random distributed log-based reconciliation problem, an important problem occurring in Computer Supported Cooperative Work. The problem is formalized using the Distributed Constraint Satisfaction paradigm. In the…