An index of datasets, SDKs, APIs and other open source code created by Microsoft researchers and shared with the broader academic community. We also maintain a collection highlighting some of the tools you’ll find here.
Scalable Language-Model-Building Tool
This scalable language-model tool is used to build language models from large amounts of data. It supports modified absolute discounting and Kneser-Ney smoothing. The tool has been used successfully to build a seven-gram language model…
Interactive Physical-Design Tuner
This is a .NET assembly with a PowerShell front end to enable interactive physical-design tuning sessions over SQL Server databases.
Guesstimate: A Programming Model for Collaborative Distributed Systems
Guesstimate is a programming model for developing collaborative distributed applications. The goal of Guesstimate is to provide a simple, object-oriented model for developing distributed-systems applications. The programming model is exposed as a C# API. The…
Generic Worker
The Generic Worker is a worker-role implementation for Windows Azure that eases deployment, instantiation, and remote invocation of existing .NET applications within Azure without changing their source code. The Generic Worker framework enables multiple .NET…