岗位
Research Intern – Interactive Multimodal Futures Group (Situated & Affective Computing)
The Interactive Multimodal Futures …
岗位
Research Intern – Applied Sciences Group (Audio/Vision/NLP/Multimodal)
The Microsoft Applied Sciences Grou…
视频
Distant conversational speech recognition: Challenges and Opportunities
State-of-the-art ASR systems excel …
视频
FOA Tokenizer: Learning Discrete Representations of Spatial Audio with Multichannel VQ-GAN
Spatial audio captures the directio…