Research Focus: Week of July 15, 2024
Advancing time series analysis with multi-granularity guided diffusion model; An algorithm-system co-design for fast, scalable MoE inference; What makes a search metric successful in large-scale settings; learning to solve PDEs without simulated data.
Proactive Resume and Pause of Resources for Microsoft Azure SQL Database Serverless
Demand-driven resource allocation for cloud databases has become a popular research direction. Recent approaches have evolved from reactive policies to proactive decision making. These approaches leverage not only the current resource demand but also the…
Intelligence Toolkit
The Intelligence Toolkit is a suite of interactive workflows for creating AI intelligence reports from real-world data sources. The toolkit is designed to help users identify patterns, answers, relationships, and risks within complex datasets, with…
Unified Database: Laying the foundation for large language model vertical applications
Unified databases offer better knowledge transfer between multimodal data types. They provide substantial corpus support for large language models and are poised to drive innovation in underlying hardware, laying the foundation for data-enhanced AI.
Research Focus: Week of June 10, 2024
In this issue: RELEVANCE automatically evaluates creative LLM responses; Recyclable vitrimer-based printed circuit boards; Lean Attention: Hardware-aware scalable attention mechanism; WaveCoder: a fine-tuned code LLM; New AutoGen training course.
SIBYL: A machine learning-based framework for forecasting dynamic workloads
SIBYL is a machine learning model that makes highly accurate predictions of database queries, enabling tuning for more efficiency. Applying traditional database optimizations to these predicted queries helps maintain high performance as demands change.
LST-Bench: A new benchmark tool for open table formats in the data lake
LST-Bench is a new open-source benchmark designed to evaluate table formats in cloud environments. It extends existing benchmarks to better reflect real-world usage & performance of data lakes and easily integrates with commonly used analytical…