I am a Researcher in the Data Management, Exploration and Mining (DMX) group at Microsoft Research. Before joining Microsoft, I completed my Ph.D. in Computer Science at University of Illinois at Urbana-Champaign under the supervision of Prof. Jiawei Han, my M.Phil. at The Chinese University of Hong Kong, advised by Jeffery Xu Yu, and my B.S. at Renmin University of China, advised by Shan Wang and Qing Zhu.
My research goals and interests span different aspects of large-scale data management, including querying and exploring “big” data, optimizing database systems, data mining algorithms and applications, and privacy-preserving data analytics. I am particularly interested in (randomized or approximation) algorithms which have performance guarantees in theory, and are effective and robust in practice as well. More recently, I am interested in:
- Searching and Exploring Big Data: a) processing analytical queries on large-scale data (e.g., with billions of rows) with approximate answers in interactive response time (e.g., one hundred milliseconds); b) inventing new search models and interfaces to help people explore structured/semi-structured (text) data more easily, and developing efficient algorithms and index structures to support them.
- Query Processing in Database Systems: a) faster algorithms for building block components (e.g., set intersection); b) progress estimation in query processing.
- Data Mining: developing data mining algorithms for various applications.
- Data Privacy
- Graphs in Databases
[VLDB 2016] Online Minimum Matching in Real-Time Spatial Data: Experiments and Analysis
Yongxin Tong, Jieying She, Bolin Ding, Lei Chen, Tianyu Wo, and Ke Xu
[VLDB 2016] Design of Policy-Aware Differentially Private Algorithms
Samuel Haney, Ashwin Machanavajjhala, and Bolin Ding
[SIGMOD 2016] Sample + Seek: Approximating Aggregates with Distribution Precision Guarantee
Bolin Ding, Silu Huang, Surajit Chaudhuri, Kaushik Chakrabarti, and Chi Wang
[SIGMOD 2016] Quickr: Lazily Approximating Complex Ad-Hoc Queries in Big Data Clusters
Srikanth Kandula, Anil Shanbhag, Aleksandar Vitorovic, Matthaios Olma, Robert Grandl, Surajit Chaudhuri, and Bolin Ding
[SIGMOD 2016] Operator and Query Progress Estimation in Microsoft SQL Server Live Query Statistics
Kukjin Lee, Arnd Christian Konig, Vivek Narasayya, Bolin Ding, Surajit Chaudhuri, Brent Ellwein, Alexey Eksarevskiy, Manbeen Kohli, Jacob Wyant, Praneeta Prakash, Rimma Nehme, Jiexing Li, and Jeff Naughton
[ICDE 2016] Online Mobile Micro-Task Allocation in Spatial Crowdsourcing
Yongxin Tong, Jieying She, Bolin Ding, Libin Wang, and Lei Chen
[ICDCS 2016] Enabling Privacy-Preserving Incentives for Mobile Crowd Sensing Systems
Haiming Jin, Lu Su, Bolin Ding, Klara Nahrstedt, and Nikita Borisov
[SIGMOD 2015] S4: Top-k Spreadsheet-Style Search for Query Discovery
Fotis Psallidas, Bolin Ding, Kaushik Chakrabarti, and Surajit Chaudhuri
[VLDB 2015] Finding Patterns in a Knowledge Base using Keywords to Compose Table Answers
Mohan Yang, Bolin Ding, Surajit Chaudhuri, and Kaushik Chakrabarti
[KDD 2014] Scalable Near Real-Time Failure Localization of Data Center Networks
Herodotos Herodotou, Bolin Ding, Shobana Balakrishnan, Geoff Outhred, and Percy Fitter
[SIGMOD 2014] Discovering Queries based on Example Tuples
Yanyan Shen, Kaushik Chakrabarti, Surajit Chaudhuri, Bolin Ding, and Lev Novik
[SIGMOD 2014] Blowfish Privacy: Tuning Privacy-Utility Trade-offs using Policies
Xi He, Ashwin Machanavajjhala, and Bolin Ding
[VLDB 2013] Attraction and Avoidance Detection from Movements
Zhenhui Li, Bolin Ding, Fei Wu, Tobias Kin Hou Lei, Roland Kays, and Margaret C. Crofoot
[KDD 2013] EventCube: Multi-Dimensional Search and Mining of Structured and Text Data
Fangbao Tao, et al.
More in DBLP and Google Scholar