I am a researcher at the Data Management, Exploration and Mining (DMX) group at Microsoft Research. I finished my PhD from University of Wisconsin-Madison with Prof. Jeffrey Naughton.
Recently I have been working on Self-service Data Preparation (opens in new tab), where we develop technologies to automate a variety of data-preparation tasks in the context of data science and business intelligence workflows.
Our research has been recognized with best paper awards at VLDB and SIGMOD. Some of our technologies have also been integrated into Microsoft products such as Power Query (opens in new tab) for Power BI (opens in new tab) (program synthesis, operator recommendations), Excel (opens in new tab) (error detection, data cleansing), Azure Machine Learning (opens in new tab) (data prep), and Azure Purview (opens in new tab) (auto-tagging in data lake).
Previously I worked on search engine query-log mining (Entity-Synonym (opens in new tab), Attribute-Synonym (opens in new tab), Acronym (opens in new tab), etc.), which are used in applications like Bing Snapp (opens in new tab) and Bing Knowledge Widget (opens in new tab).