I am a Researcher in the Speech and Dialog Research Group at Microsoft. My research has been evolving with the goal of creating computers that can recognize human speech regardless of acoustic environments. Specific areas I have been working in include blind source separation, blind dereverberation, microphone arrays, acoustic modeling, far-field speech recognition, and applications of deep neural networks to audio and speech processing.
Prior to joining Microsoft in 2016, I worked at NTT Communication Science Laboratories in Japan as a Research Scientist for ten years. I also conducted research at the University of Cambridge as a Visiting Scholar in 2013 and worked for Doshisha University as a Part-Time Lecturer in 2015. At NTT, I led an effort to develop its CHiME-3 far-field speech recognition system, which won the Challenge with a significant margin over other opponents. I also contributed to the development of NTT’s REVERB Challenge system, which ranked best in both single and multi-microphone categories. I developed several dereverberation algorithms with my colleagues, which are called a weighted prediction error (WPE) method.
I received the Best Paper Award Honorable Mention at ASRU 2015, the Awaya Prize Young Researcher Award and the Itakura Prize Innovative Young Researcher Award from the ASJ in 2010 and 2011, respectively, and the Young Researcher’s Award in Speech Field from the IEICE-ISS in 2011. I earned my Ph.D. in Informatics from Kyoto University, Japan, in 2010. I am a member of the IEEE and IPSJ.
Lists of my publications can be found at Google Scholar.