Discover an index of datasets, SDKs, APIs and open-source tools developed by Microsoft researchers and shared with the global academic community below. These experimental technologies—available through Azure AI Foundry Labs (opens in new tab)—offer a glimpse into the future of AI innovation.
AdversarialGMM
Code for replication of experiments in the paper: Minimax Estimation of Conditional Moment Models Nishanth Dikkala, Greg Lewis, Lester Mackey, and Vasilis Syrgkanis arXiv preprint arXiv:2006.07201 (2020)
MoveBox for Microsoft Rocketbox
MoveBox is a toolbox to animate the Microsoft Rocketbox avatars using motion captured (MoCap). Motion capture is performed using a single depth sensor, such as Azure Kinect or Windows Kinect V2. Our toolbox enables real-time…
Search4Code
Search4Code is a large-scale web query based dataset of code search queries for C# and Java. The Search4Code data is mined from Microsoft Bing’s anonymized search query logs using weak supervision technique.
Uncertainty-aware Self-training for Few-shot Text Classification (code)
Uncertainty-aware self-training (UST) for few-shot text classification with pre-trained language models. With only 20-30 labeled samples per class for each task, UST can perform similar to fully supervised pre-trained language models like BERT fine-tuned on…
REST API Fuzz Testing
This self-hosted service developed for Azure, including its orchestration engine and security tools (including MSR’s RESTler), enables developers to embed security tooling into their CI/CD workflows.
GitHub Publication Publication Publication Publication Publication
RESTler-Fuzzer
RESTler is the first stateful REST API fuzzing tool for automatically testing cloud services through their REST APIs and finding security and reliability bugs in these services.
GitHub Publication Publication Publication Publication Publication
Microsoft.Recognizers.Text
Microsoft.Recognizers.Text provides robust recognition and resolution of entities like numbers, units, and date/time; expressed in multiple languages. Full support for Chinese, English, French, Spanish, Portuguese, German, Italian, Turkish, and Hindi. Partial support for Dutch, Japanese,…