MunTTS: A Text-to-Speech System for Mundari
We present MunTTS, an end-to-end text-to-speech (TTS) system specifically for Mundari, a low-resource Indian language of the Austo-Asiatic family. Our work addresses the gap in linguistic technology for underrepresented languages by collecting and processing data to…
Research Focus: Week of May 13, 2024
Welcome to Research Focus, a series of blog posts that highlights notable publications, events, code/datasets, new hires and other milestones from across the research community at Microsoft. Large language models (LLMs) have shown remarkable performance…
MSR Talk: Unsupervised Speech Reverberation Control with Diffusion Implicit Bridges
Speaker(s): Eloi MolinerHost: Hannes Gamper Speech reverberation control involves the manipulation of acoustic characteristics in speech recordings, including tasks like speech dereverberation or reverberation time reduction. Diffusion implicit bridges are a recently proposed domain translation…
End-to-End Automatic Speech Recognition
Large Language Models Cannot Explain Themselves
TREC Tip-of-the-Tongue Track
Tip-of-the-tongue (ToT) known-item retrieval is defined as “an item identification task in which the searcher has previously experienced an item but cannot recall a reliable identifier” (i.e., “It’s on the tip of my tongue…”). The…