Portrait of Lei Zhang

Lei Zhang

Principal Research Manager


Lei Zhang is a principal researcher and research manager in Microsoft AI & Research, leading a team working on visual recognition and computer vision.

Prior to this, he has worked with Microsoft Research Asia for 12 years as a senior researcher, leading a research team working on visual recognition, image analysis, and large-scale data mining. His years of work on large-scale, search-based image annotation has generated many practical impacts in multimedia search, including a highly scalable solution of duplicate image clustering for billions of images. From 2013 to 2015, he moved to Bing Multimedia Search as a principal development manager, helping develop cutting-edge solutions for web-scale image analysis and recognition problems, including image caption generation and high precision image entity linking.

Lei is a senior IEEE member and a senior ACM member, and has served as editorial board members for Multimedia System Journal, as program co-chairs, area chairs, or committee members for many top conferences. He is the author or co-author of 100+ published papers in fields such as multimedia, computer vision, web search and information retrieval, and holds 40+ U.S. patents for his innovation in these fields.

Lei earned all his degrees (B.E., M.E., and Ph.D) in Computer Science from Tsinghua University, and currently also holds an adjunct professor position in Tianjin University.


MS-Celeb-1M: Challenge of Recognizing One Million Celebrities in the Real World

Established: June 29, 2016

MSR Image Recognition Challenge (IRC) @ACM Multimedia 2016 Import Dates/Updates: New! We are hosting new challenges at ICCV 2017. Visit MsCeleb.org for more details. Participants information disclosed in "Team Information" section below 6/21/2016: Evaluation Result Announced in "Evaluation Result " section below. 6/17/2016: Evaluation finished. 14 teams finished the grand challenge! 6/13/2016: Evaluation started. 6/13/2016: Dry run finished, 14 out of 19 teams passed, see details in "Update Details" below 6/10/2016: Dry run update 3: 8 teams…

MSRA-CFW: Data Set of Celebrity Faces on the Web

Established: May 20, 2012

Dataset Descriptions The dataset includes image URLs for 202792 faces. The labels of the faces are automatically generated by the algorithm in [1], with high accuracy. To facilitate downloading the images, we provide a number of URLs for the near-duplicates of each face. Besides, the thumbnail images and facial features(LBP)[2] are also provided for visualization and benchmarking purposes. Due to copyright reasons, we donnot provide the original web images. In the dataset, the files for…

Website Structure Understanding and its Applications

Established: August 12, 2010

Website structure understanding can be treated as a reverse engineering for the purpose of automatically discovering the layout templates and URL patterns of a website, and understanding how these templates and patterns are integrated to organize the website. The study of this problem has had a great impact to many applications which can leverage such site-level knowledge to help web search and data mining. Almost every website on the Internet has a distinct design &…

MindFinder: Finding Images by Sketching

Established: August 12, 2009

Sketch-based image search is a well-known and difficult problem, in which little progress has been made in the past decade in developing a large-scale and practical sketch-based search engine. We have revisited this problem and developed a scalable solution to sketch-based image search. The MindFinder system has been built by indexing more than 1.5 billion web images to enable efficient sketch-based image retrieval, and many creative applications can be expected to advance the state of…

Arista (lARge-scale Image Search To Annotation)

Established: January 1, 2006

Arista is a data-driven image annotation system, which annotates an image based on large-scale image search. Its assumption is that close similar images share similar semantics. It targets at a practical image annotation engine which is able to automatically annotate images of any popular concepts. Starting from 2006, Arista is now able to perform online tagging based on 2 billion web images leveraging near-duplicate detection technique. The Key Idea The key hinder factor of computer…





Mining Text Snippets for Images on the Web
Anitha Kannan, Simon Baker, Krishnan Ramnath, Juliet Fiss, Dahua Lin, Lucy Vanderwende, Rizwan Ansary, Ashish Kapoor, Qifa Ke, Matt Uyttendaele, Xin-Jing Wang, Lei Zhang, in KDD '14 Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM Press, August 24, 2014, View abstract, Download PDF





Interest Seam Image
Xiao Zhang, Gang Hua, Lei Zhang, Heung-Yeung Shum, Harry Shum, in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, June 1, 2010, View abstract