Discover an index of datasets, SDKs, APIs and open-source tools developed by Microsoft researchers and shared with the global academic community below. These experimental technologies—available through Azure AI Foundry Labs (opens in new tab)—offer a glimpse into the future of AI innovation.
Microsoft Research Paraphrase Phrase Tables
This archive contains phrase tables generated by aligning the two paraphrase data sets described in Quirk, Brockett & Dolan (2004) and Dolan, Quirk & Brockett (2004). The alignments are bidirectional, created using the method described…
ESL 123 Mass Noun Examples
The ESL_123_MASS_NOUN dataset is a set of 123 sentences, found on the World Wide Web, that apparently were written by native speakers of languages spoken in China. Each sentence contains an example of at least…