Jinyu Li

Partner Applied Science Manager

About

Jinyu Li received his Ph.D. degree in electrical and computer engineering from Georgia Institute of Technology, Atlanta, GA, USA in 2008. He joined Microsoft, Redmond, WA, USA in 2008 and now serves as Partner Applied Science Manager, leading a dynamic team dedicated to designing and enhancing speech modeling algorithms and technologies. Their aim is to ensure that Microsoft products maintain cutting-edge quality within the industry. His diverse research areas include end-to-end modeling for speech recognition and speech translation, deep learning, acoustic modeling, and noise robustness.

Dr. Li is an IEEE Fellow, for contributions to deep-learning-based speech technology innovation and commercialization. He is also an AAIA Fellow. He has been a member of IEEE Speech and Language Processing Technical Committee from 2017 to 2023. He also served as the associate editor of IEEE/ACM Transactions on Audio, Speech and Language Processing from 2015 to 2020. He was awarded as the Industrial Distinguished Leader at Asia-Pacific Signal and Information Processing Association (APSIPA) in 2021 and APSIPA Sadaoki Furui Prize Paper Award in 2023. He is named as Distinguished Industry Speakers for IEEE Signal Processing Society, 2025.

Latest CV is available here (opens in new tab).

Latest publication is available from my Google scholar page (opens in new tab).

What’s New

Jan. 2026: Elected as the vice chair of IEEE Speech and Language Technical Committee (opens in new tab).
Dec. 2025: wavLM (opens in new tab) paper received best paper award from IEEE SPS.
Mar. 2025: We released Phi-4-multimodal, an advanced model capable of processing inputs from speech, vision, and text. It provides exceptional performance for speech recognition, speech translation, speech QA, speech summarization, and audio understanding. Technical report (opens in new tab).
Mar. 2025: AAIA (opens in new tab) Fellow.
Jan. 2025: IEEE Fellow (opens in new tab), for contributions to deep-learning-based speech technology innovation and commercialization.
Jan. 2025: Distinguished Industry Speakers (opens in new tab) for IEEE Signal Processing Society. Look forward to working with IEEE SPS chapters to present the speech and multimodal topics in 2025 and 2026.
May. 2024: IEEE SPS webinar “End-to-End Automatic Speech Recognition”, Slides, Video. (opens in new tab)
Nov. 2023: It is my great honor to receive the APSIPA Sadaoki Furui Prize Paper Award for the paper “Recent Advances in End-to-End Automatic Speech Recognition (opens in new tab)” published in APSIPA Transactions on Signal and Information Processing, 2022. Here is the invited talk at APSIPA ASC 2023 for receiving the Award.
Dec. 2022: It was my pleasure to give a keynote talk, “Advancing end-to-end automatic speech recognition and beyond”, at International Symposium on Chinese Spoken Language Processing (ISCSLP). Slides, Video (opens in new tab).
Apr. 2022: The survey paper “Recent Advances in End-to-End Automatic Speech Recognition (opens in new tab)” is published in APSIPA Transactions on Signal and Information Processing.