Tea: A High-level Language and Runtime System for Automating Statistical Analysis
Current statistical tools place the burden of valid, reproducible statistical analyses on the user. Users must have deep knowledge of statistics to not only identify their research questions, hypotheses, and domain assumptions but also select…
Microsoft at SIGCOMM 2019
Microsoft is excited to be a Silver sponsor for this year’s flagship annual conference SIGCOMM ACM (Special Interest Group on Data Communication) on the applications, technologies, architectures, and protocols for computer communication.
DIFF: A Relational Interface for Large-Scale Data Explanation
A range of explanation engines assist data analysts by performing feature selection over increasingly high-volume and high-dimensional data, grouping and highlighting commonalities among data points. While useful in diverse tasks such as user behavior analytics,…
Resource-Efficient Redundancy for Large-Scale Data Processing and Storage Systems
Large-scale systems are often subject to non-ideal conditions such as failures, stragglers, load imbalance, etc. These issues adversely affect query latency in data-processing systems, and durability and access latency in storage systems. Redundancy (duplication of…