An index of datasets, SDKs, APIs and other open source code created by Microsoft researchers and shared with the broader academic community. We also maintain a collection highlighting some of the tools you’ll find here.
Microsoft Research Video Description Corpus
This data consists of about 120K sentences collected during the summer of 2010. Workers on Mechanical Turk were paid to watch a short video snippet and then summarize the action in a single sentence. The…
Photosynth Plug-in for Photoshop (32-bit and 64-bit)
An export plug-in for Adobe Photoshop that uploads panoramic images to the Photosynth web service.
Scalable Language-Model-Building Tool
This scalable language-model tool is used to build language models from large amounts of data. It supports modified absolute discounting and Kneser-Ney smoothing. The tool has been used successfully to build a seven-gram language model…
Interactive Physical-Design Tuner
This is a .NET assembly with a PowerShell front end to enable interactive physical-design tuning sessions over SQL Server databases.
Guesstimate: A Programming Model for Collaborative Distributed Systems
Guesstimate is a programming model for developing collaborative distributed applications. The goal of Guesstimate is to provide a simple, object-oriented model for developing distributed-systems applications. The programming model is exposed as a C# API. The…