About

Xiaodong He (何晓冬) is a Senior Researcher in the Deep Learning Technology Center of Microsoft Research, Redmond, WA, USA. He is also an Affiliate Professor in the Department of Electrical Engineering at the University of Washington (Seattle) serving in the PhD reading committees. His research interests are mainly in the machine intelligence areas including deep learning, natural language, vision, speech, information retrieval, and knowledge representation.

He has published in ACL, EMNLP, NAACL, CVPR, SIGIR, WWW, CIKM, NIPS, ICLR, ICASSP, Proc. IEEE, IEEE TASLP, IEEE SPM, and other venues. He has received several awards including the Outstanding Paper Award of ACL 2015. He and colleagues developed the MSR-NRC-SRI entry and the MSR entry that won No. 1 in the 2008 NIST Machine Translation Evaluation and the 2011 IWSLT Evaluation (Chinese-to-English), respectively, and the MSR image captioning system that won the 1st Prize at the MS COCO Captioning Challenge 2015. His work on image captioning was reported in Communications of the ACM in January 2016.

His current research is mostly focused on Deep learning for semantics and applications to text, vision, information retrieval, and knowledge graph. Relevant studies are summarized in the recent tutorial at CIKM 2014. More details at the DSSM site (and the DSSM toolkit download).

He has held editorial positions on several IEEE Journals, served as an area chair for NAACL-HLT 2015, and served in the organizing committee/program committee of major speech and language processing conferences. He is an elected member of the IEEE SLTC for the term of 2015-2017. He is a senior member of IEEE and a member of ACL.

He received the BS degree from Tsinghua University (Beijing) in 1996, MS degree from Chinese Academy of Sciences (Beijing) in 1999, and the PhD degree from the University of Missouri – Columbia in 2003.

Projects

MS-Celeb-1M: Challenge of Recognizing One Million Celebrities in the Real World

Established: June 29, 2016

MSR Image Recognition Challenge (IRC) @ACM Multimedia 2016 Latest Updates: 6/21/2016: Evaluation Result Announced, see details in "Update Details" below. 6/17/2016: Evaluation finished. 14 teams finished the grand challenge! 6/13/2016: Evaluation started. 6/13/2016: Dry run finished, 14 out of 19…

From Captions to Visual Concepts and Back

Established: April 9, 2015

We introduce a novel approach for automatically generating image descriptions. Visual detectors, language models, and deep multimodal similarity models are learned directly from a dataset of image captions. Our system is state-of-the-art on the official Microsoft COCO benchmark, producing a…

DSSM

Established: January 30, 2015

Deep Structured Semantic Model / Deep Semantic Similarity Model   What is DSSM? DSSM stands for Deep Structured Semantic Model, or more general, Deep Semantic Similarity Model. DSSM, developed by the MSR Deep Learning Technology Center(DLTC), is…

Publications

2016

Visual Storytelling
Ting-Hao (Kenneth) Huang, Francis Ferraro, Nasrin Mostafazadeh, Ishan Misra, Aishwarya Agrawal, Jacob Devlin, Ross Girshick, Xiaodong He, Pushmeet Kohli, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh, Lucy Vanderwende, Michel Galley, Margaret Mitchell, in Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, ACL – Association for Computational Linguistics, April 1, 2016, View abstract, Download PDF, View external link

2015

From Captions to Visual Concepts and Back
Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh Srivastava, Li Deng, Piotr Dollar, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John Platt, Lawrence Zitnick, Geoffrey Zweig, Larry Zitnick, in The proceedings of CVPR, IEEE – Institute of Electrical and Electronics Engineers, June 1, 2015, View abstract, Download PDF

2014

2013

2012

2011

2010

2009

2008

2007

2006

2004

2003

2002

2001

2000

Projects

Deep Learning for Text Processing Link description

Deep Learning for Text Processing

Date

August 4, 2014

Speakers

Li Deng, Eric Xing, Xiaodong He, Jianfeng Gao, Christopher Manning, Paul Smolensky, and Jeff A Bilmes

Affiliation

MSR, Carnegie Mellon University, Microsoft Research, Redmond, MSR Redmond, Stanford, Johns Hopkins University, University of Washington

Other

Recent media coverage

News and events

Invited talks & tutorials

Selected work on Deep Learning and applications to NLP, Vision, SLU, IR, and Knowledge Representation

Academic services

  • Member of the IEEE Speech and Language Processing Technical Committee 2015-2017
  • Area Chair, Spoken Language Processing, NAACL 2015
  • Associate Editor, IEEE Signal Processing Letters since 2014
  • Member of the Organizing Committee, Chair of Special Sessions, IEEE ICASSP 2013
  • Associate Editor, IEEE Signal Processing Magazine since 2012
  • Guest Editor, Special Issue on Continuous-space and related methods in natural language processing, in IEEE Transactions on Audio, Speech, and Language Processing, 2014
  • Guest Editor, Special Issue on Large-Scale Optimization for Audio, Speech, and Language Processing, in IEEE Transactions on Audio, Speech, and Language Processing, 2013
  • Lead Guest Editor, Special Issue on Statistical Learning Methods for Speech and Language Processing, in IEEE Journal of Selected Topics in Signal Processing, 2010
  • Co-Chair, NIPS 2008 Workshop on Speech and Language: Learning-Based Methods and Systems, Whistler, BC, Canada, 2008
  • Grant Reviewer: Swiss National Science Foundation
  • Program Committee Member: ACL, NAACL, EMNLP, COLING, AAAI
  • Reviewer: IEEE Transactions on Speech and Audio Processing, Proceedings of the IEEE, IEEE Signal Processing Magazine, IEEE Signal Processing Letters, IEEE Transactions on Computer, Speech Communication, Pattern Recognition, Pattern Recognition Letters, ICASSP, Interspeech, NIPS

Honors and awards

  • ACL 2015 Outstanding Paper Award
  • 1st Prize, MS COCO Captioning Challenge 2015
  • No. 1 Place, Chinese to English MT track, 2011 IWSLT Evaluation
  • No. 1 Place, Chinese to English common data track, 2008 NIST MT Evaluation
  • ICASSP 2011 Best Student Paper Award (co-author)
  • IEEE senior member since 2008
  • Microsoft Gold Star Award, 2005
  • Microsoft Patent awards, 2005-2014
  • Microsoft Technology Transfer Award, 2009, 2014

Special issues

NIPS 2008 workshop

The NIPS 2008 workshop on Speech and Language: Learning-based Methods and Systems covers a variety of advanced topics in the Speech and Language Processing area. More details can be found at the workshop’s homepage NIPS08 WSL(a)

Downloads