Project Dhaka: Single-cell dimensionality reduction
Python module applying deep learning to improve clustering and other analysis of single-cell genomic data (gene expression and copy number variation).
Discover an index of datasets, SDKs, APIs and open-source tools developed by Microsoft researchers and shared with the global academic community below. These experimental technologies—available through Azure AI Foundry Labs (opens in new tab)—offer a glimpse into the future of AI innovation.
Python module applying deep learning to improve clustering and other analysis of single-cell genomic data (gene expression and copy number variation).
This is DLT, a dual learning toolkit developed by Microsoft Research. Dual learning leverages the structure duality among AI tasks (e.g., English-to-French translation vs. French-to-English translation, speech recognition vs. text to speech, and image classification…
Use consumer video equipment to trace animal movement. Source code of the project that has been released in precompiled binary form in 2014 https://www.microsoft.com/en-us/download/details.aspx?id=52266
Detours is a software package for monitoring and instrumenting API calls on Windows. Detours has been used by many ISVs and is also used by product teams at Microsoft. Detours is now available under a…
MASS is a novel pre-training method for sequence to sequence based language generation tasks. It randomly masks a sentence fragment in the encoder, and then predicts it in the decoder.
EconML is a Python package for estimating heterogeneous treatment effects from observational data via machine learning. This package was designed and built as part of the ALICE project at Microsoft Research with the goal to…
FHIR Server for Azure is an open-source implementation of the emerging HL7 Fast Healthcare Interoperability Resources (FHIR) specification designed for the Microsoft cloud. The FHIR specification defines how clinical health data can be made interoperable across systems,…
A phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to characters.