Portrait of Chi Wang

Chi Wang



I am a researcher in the Data Management, Exploration and Mining (DMX) group at Microsoft Research Redmond Lab.

I enjoy discovering principles and creating tools for large-scale data science and data analytics. My recent focus is to help data scientists efficiently conquer big data and accomplish their tasks faster. One general approach I have been studying is to use sampling technique to generate approximate answer with theoretical guarantee of error bound. I have developed sublinear solutions for a number of time-consuming tasks in data science and data analytics:

Data exploration
Outlier detection
Data aggregation (interactive visual analytics)

I am also interested in mining unstructured data, such as text, and graphs, which won me SIGKDD Data Science/Data Mining PhD Dissertation Award in 2015. My work in these areas can be found here.

I serve as a PC member in data mining, database, NLP and machine learning conferences. I also had a great pleasure to work with researchers in theory and HCI.






Scalable Data Science

Text Mining & Information Extraction

Social Network Analysis & Mining

Professional services

Conference program committee

  • ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) – 2018, 2016, 2015
  • International Conference on Very Large Data Bases (VLDB) – 2018, 2017
  • International Conference on Machine Learning (ICML) – 2019
  • International World Wide Web Conference (WWW) – 2018, 2017, 2016, 2015
  • International Conference on Web Search and Data Mining (WSDM) – 2019, 2018, 2016, 2015, 2014
  • Annual Meeting of the Association for Computational Linguistics (ACL) – 2017, 2015, 2013
  • Conference on Empirical Methods in Natural Language Processing (EMNLP) – 2018, 2015, 2013
  • Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT) – 2018
  • International Conference on Data Engineering (ICDE) – 2016
  • ACM Conference on Information and Knowledge Management (CIKM) – 2015, 2014
  • IEEE International Conference on Data Mining (ICDM) – 2015, 2014
  • IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) – 2016, 2015
  • International Joint Conference on Artificial Intelligence (IJCAI) – 2016, 2015, 2013

 Journal reviewer

  • IEEEs Transactions on Knowledge and Data Engineering (TKDE)
  • ACM Transactions on Knowledge Discovery from Data (TKDD)
  • The Proceedings of the VLDB Endowment (PVLDB)
  • Transactions on Information Systems (TOIS)
  • IEEEs Transactions on Big Data
  • Knowledge-Based Systems (KNOSYS)
  • ACM Transactions on Intelligent Systems and Technology (TIST)
  • Pattern Recognition Letters (PRLETTERS)
  • Social Network Analysis and Mining (SNAM)
  • Neurocomputing (NEUCOM)


Selected awards

  • OneML Windows 10 COIN Hackathon 2nd runner-up, 2017
  • SIGKDD Data Science/Data Mining PhD Dissertation Award, 2015
  • WSDM Outstanding Reviewer Award, 2015
  • Grand Prize in Yelp Dataset Challenge – Winner of Round Four, 2015
  • Microsoft Research Graduate Research Fellowship, 2011-2013
  • ACM SIGKDD Cup Data Mining Contest 2013 Track II Runner-up, 2013
  • Yahoo!-DAIS Research Excellence Award, 2014 & 2012
  • Champion in ACM International Collegiate Programming Contest – Mid-Central USA, 2009
  • National Olympiad in Informatics of China – Silver, 2005