Spreadsheet Intelligence for Ideas of Excel
Ideas in Excel aims at such one-click intelligence—when a user clicks the Ideas button on the Home tab of Excel, the intelligent service will empower the user to understand his or her data via automatic…
Immersive Storytelling with Data in Plain & Mixed Reality
Immersive visualization and storytelling techniques allow insights in complex data to be more effectively communicated to users.
The Perspective Engine: Making Numbers Easier to Understand
The Perspective Engine is an AI tool that automates the process of putting giant and unfamiliar numbers into familiar context, such as demonstrating the capacity of a CD-ROM with a stack of paper hundreds of…
Designing Succinct Secondary Indexing Mechanism by Exploiting Column Correlations
Database admins construct secondary indexes on data tables to accelerate query processing in relational database management systems (RDBMSs). Unfortunately, maintaining multiple secondary indexes in the same database can be extremely space consuming, causing significant performance…
Fast Approximation of Empirical Entropy via Subsampling
The promotional video for the KDD 2019 paper: Fast Approximation of Empirical Entropy via Subsampling by Chi Wang and Bailu Ding.
Gen Encoder
Gen Encoder is a deep learning model that generically represents (embeds) search queries and questions based on their user intents. It leverages large scale user interaction data in Bing to learn this state-of-the-art intent-oriented representation.…
SILK: Preventing Latency Spikes in Log-Structured Merge Key-Value Stores
LSM-based KV stores are designed to offer good write performance by capturing client writes in memory and only later flushing them to storage. Writes are later compacted into a tree-like data structure on disk to…
Trill Samples
This repository hosts samples that demonstrate how to use Trill, a high-performance one-pass in-memory streaming analytics engine from Microsoft Research. It can handle both real-time and offline data, and is based on a temporal data…
DoWhy: Causal Reasoning for Designing and Evaluating Interventions
Today’s computing systems can be thought of as interventions in people’s work and daily lives. But what are the outcomes of these interventions, and how can we tune these systems for desired outcomes? In this…