Communicating Data to an Audience
Unsupervised Streaming Cyber-Analytics
The supervised learning approach to cyber-analytics has proven rather successful. However, there are challenges with this approach, including a frequent dearth of labelled data, the issue of temporal variation, and fundamentally, problems of data volume…
Microsoft and Tsinghua University Work Together on Open Academic Data Research
In a recent collaboration, Microsoft and China’s Tsinghua University released an academic graph, named Open Academic Graph (OAG). This billion-scale academic graph integrates the current Microsoft Academic Graph (MAG) and Tsinghua’s AMiner academic graph. Specifically,…
Observing and Controlling Distributed Systems with Cross-Cutting Tools
Distributed systems represent some of the most interesting and successful computing applications in use today, from modern web applications and social networks, to data analytics and large-scale machine learning. However, it is notoriously difficult to…
FASTER
FASTER is a new key-value store for point operations, that combines a highly cache-optimized concurrent hash index with a novel self-tuning data organization. It extends the standard key-value store interface to handle read-modify-writes and blind…
Augmenting Visualization Tools with Automated Design & Recommendation
Visualization is a critical tool for data science. Analysts use plots to explore and understand distributions and relationships in their data. Machine learning developers also use diagrams to understand and communicate complex model structures. Yet…