I am a researcher at Microsoft Research India working in the areas of Machine Learning, Natural Language Systems and Applications, as well as Technology for Emerging Markets. My research interests lie broadly in the area of Speech and Language Technology especially in the use of linguistic models for building technology that offers a more natural Human-Computer as well as Computer-Mediated interactions.
I am currently working on Project Mélange where we try to understand, process and generate Code-mixed language data for both text and speech. Code-mixing or use of more than one languages in a single conversation or utterance is a phenomenon that is observed in all multilingual societies. Though Code-mixing has been studied in the past as a feature of conversational speech, the rapid rise of social-media and other online forums, has made it a common phenomenon for text as well. Conversational speech applications, like personal assistants as well as speech-to-speech translations, make it imperative that we know how to model this in speech as well.
Recently, I have become interested in how social and pragmatic functions affect language use, in code-mixed as well as monolingual conversations, and how to build effective computational models of sociolinguistics and pragmatics that can lead to more aware Artificial Intelligence.
I am also very passionate about NLP and Speech technology for Indian Languages. I believe that local language technology especially with speech interfaces, can help millions of people gain entry into a world that is till now almost inaccessible to them. I have served, and continue to serve, on several government and other committees that work on Indian Language Technologies as well as Linguistic Resources and Standards for NLP/Speech.