About

I am a Researcher in the Data Management, Exploration and Mining (DMX) group at Microsoft Research. Before joining Microsoft, I completed my Ph.D. in Computer Science at University of Illinois at Urbana-Champaign under the supervision of Prof. Jiawei Han.

I enjoy discovering principles and creating tools for large-scale data analysis and knowledge acquisition. In this information-overloaded world, I am especially interested in helping people efficiently conquer overwhelming data and information, by 1) revealing latent structures from unstructured or loosely structured data, 2) integrating, cleaning and reorganizing them, and 3) interactively exploring the data.

Projects

Concept Expansion

Established: November 10, 2014

Given a concept name, and seed entities, return entities and tables in this concept. Sway Presentation

Publications

2017

2016

2015

2014

2013

2012

2011

2010

2009

Other

Books and Conference Tutorials

Topic and Phrase Mining

Community, Role and Relationship Discovery

Data Integration, Cleaning and Filtering

Social Influence Analysis

Professional services

Conference program committee

  • ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) – 2016, 2015
  • International Conference on Very Large Data Bases (VLDB) – 2017
  • International World Wide Web Conference (WWW) – 2016, 2015 & Workshop in Natural Language Processing for Social Media
  • International Conference on Web Search and Data Mining (WSDM) – 2016, 2015, 2014 (Workshop in Diffusion Networks and Cascade Analytics)
    • WSDM 2015 Outstanding Reviewer Award
  • International Conference on Data Engineering (ICDE) – 2016
  • ACM Conference on Information and Knowledge Management (CIKM) – 2015, 2014
  • IEEE International Conference on Data Mining (ICDM) – 2015, 2014
  • IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) – 2016, 2015
  • Conference on Empirical Methods in Natural Language Processing (EMNLP) – 2015, 2013
  • Annual Meeting of the Association for Computational Linguistics (ACL) – 2015 (Workshop in Natural Language Processing for Social Media), 2013
  • International Joint Conference on Artificial Intelligence (IJCAI) – 2016, 2015 (Workshop on Social Influence Analysis), 2013 (Workshop on Heterogeneous Information Network Analysis)
  • International Conference on Advanced Data Mining and Applications (ADMA) – 2012, 2011

 Journal reviewer

  • IEEEs Transactions on Knowledge and Data Engineering (TKDE)
  • ACM Transactions on Knowledge Discovery from Data (TKDD)
  • The Proceedings of the VLDB Endowment (PVLDB)
  • Transactions on Information Systems (TOIS)
  • IEEEs Transactions on Big Data
  • Knowledge-Based Systems (KNOSYS)
  • ACM Transactions on Intelligent Systems and Technology (TIST)
  • Pattern Recognition Letters (PRLETTERS)
  • Social Network Analysis and Mining (SNAM)
  • Neurocomputing (NEUCOM)
  • The Arabian Journal for Science and Engineering (AJSE)