I am a research manager at Microsoft. I work on performance optimization of parallel and distributed systems such as machine learning systems, information retrieval systems, data management systems, and large-scale cloud infrastructure. My work was selected among the best papers for SIGIR, ICDE, WSDM and Middleware. My research results have been utilized by various Microsoft systems and products, such as Bing, Ads, AzureML and AzureSQL, boosting system performance and capacity.
My recent work focuses on optimizing deep learning systems. I lead DeepSpeed and DeepCPU projects, which strive for order(s)-of-magnitude improvement on speed and scale for DL training and inference.