I am a researcher in the Speech and Dialog Research Group at Microsoft Research, working at Microsoft’s Silicon Valley Campus. My interests include speech recognition and understanding, language modeling, speaker recognition, language and dialect recognition, machine translation, historical linguistics, and software tools for computational linguistics. I am also an External Fellow at the International Computer Science Institute (ICSI) in Berkeley, where I lead a joint project with Microsoft.
Prior to 2011, I worked in the Speech Technology and Research Laboratory at SRI International, mainly on government-funded research projects in speech recognition, speaker recognition, and machine translation, such as the DARPA projects EARS, GALE, and RATS. While at SRI, I also collaborated with ICSI on recognition of multiparty meetings and other projects, and developed the open-source SRI Language Modeling Toolkit.
My Ph.D. research was in Computer Science at the University of California, Berkeley, on parsing and Bayesian learning of stochastic grammars.
- Papers written while at SRI (1994-2011): check here
- Papers written while at Berkeley (1988-1994): check here. My thesis work is summarized in the following three papers:
- Precise n-gram probabilities from stochastic context-free grammars, with J. Segal, Proc. ACL, 1994
- Inducing Probabilistic Grammars by Bayesian Model Merging, with S. Omohundro, Proc. ICGI, 1994
- An Efficient Probabilistic Context-Free Parsing Algorithm that Computes Prefix Probabilities, Computational Linguistics 21(2), 165-201, 1995
- Humans versus machines: the case of conversational speech recognition. Invited talk at Afeka Conference for Speech Processing, Tel Aviv, July 2017
- You Talking to Me? Speech-based and multimodal approaches for human versus computer addressee detection. Invited talk at Conference on Empirical Methods in Natural Language Processing, Austin, Texas, November 2016
- Meeting Recognition and Understanding – Past, Present, and a Guess at the Future. Invited talk at International Conference on Computational Processing of Portuguese, São Carlos, Brazil, October 2014
- Tutorials on speaker recognition presented at Winter School on Speech and Audio Processing, IIT Kanpur, January 2009:
- Human vs. Machine Transcription on DataSkeptic podcast, March 2019
- Editorial services for the journals Computional Linguistics, Computer Speech and Language, and the IEEE Transactions on Audio Speech and Language.
- Currently serving on the IEEE Speech and Language Processing Technical Committee (SLTC)
- Fellow of the IEEE and member of the IEEE Signal Processing Society
- Fellow of the International Speech Communication Association (ISCA)