
We have several Internship positions open in our Munich office, and we are looking for candidates with demonstrable experience in researching, developing, and characterizing new audio signal processing technologies. You will work with a multi-interdisciplinary…
In this issue: HyWay enables hybrid mingling; Auto-Tables transforms non-relational tables into standard relational forms; training dense retrievers to identify high-quality in-context examples for LLM; improving pronunciation assessment in CAPT.
We are looking for a Senior Researcher – Applied Science in the field of audio, speech natural language and/or computer vision with expertise in deep learning techniques to help our devices compute better understanding of…
Because headphones rank among the most popular wearables in the market, we have an exciting opportunity to expand their capabilities through integrating existing sensors with supplementary ones to enable a wide variety of experiences that…
Imagine an AI model that can seamlessly generate high-quality content across text, images, video, and audio, all at once. Such a model would more accurately capture the multimodal nature of the world and human comprehension,…
A neural codec language model for speech synthesis We introduce a language modeling approach for text-to-speech synthesis (TTS). Specifically, we train a neural codec language model (called VALL-E) using discrete codes derived from an off-the-shelf…