Query Acceleration for Data Lakes
Accelerating query processing on open data formats As businesses become more data-driven, there is an increasing interest in adopting data lakes (e.g., Microsoft Fabric) in large enterprises. A data lake is a large storage repository…
Self-service Data Preparation
It is often cited that data scientists spend a significant portion of their time (up to 80%), cleaning and preparing data. For less-technical users, who may be less proficient in writing code (e.g., in Excel,…
FRA: Flexible Resource Allocation in Multi-Tenant Relational Database-as-a-Service
Oversubscription is an essential cost management strategy in multi-tenant, cloud Database-as-a-Service (DBaaS), and its importance is magnified by the emergence of serverless databases. In the FRA project, we have developed novel resource management techniques that…
VLDB 2023 Presentation for CORNET: Learning Table Formatting Rules By Example
Cornet: Learning Table Formatting Rules By Example. Mukul Singh, José Cambronero Sánchez, Sumit Gulwani, Vu Le, Carina Negreanu, Mohammad Raza, and Gust Verbruggen. Proc. VLDB Endow. 16, 10 (June 2023), 2632–2644. Link to PDF of…
STEAM: Observability-Preserving Trace Sampling
Understanding the user: How the Enterprise System Usability Scale aligns with user reality
This position research paper was presented at the 26th ACM Conference on Computer-Supported Cooperative Work and Social Computing (opens in new tab) (CSCW 2023), a premier venue for research on the design and use of…