公開日 RESPOND: Responsive Engagement Strategy for Predictive Orchestration and Dialogue Meng-Chen Lee, Costas Panay, Javier Hernandez, Sean Andrist, Dan Bohus, Anatoly Churikov, Andrew D. Wilson March 2026 プロジェクト
公開日 Sirens’Whisper: Inaudible Near-Ultrasonic Jailbreaks of Speech-Driven LLMs Zijian Ling, Pingyi Hu, Xiuyong Gao, Xiaojing Ma, Man Zhou, Jun Feng, Songfeng Lu, Dongmei Zhang, Bin Benjamin Zhu March 2026
公開日 VibeVoice: Expressive Podcast Generation with Next-Token Diffusion Zhiliang Peng, Jianwei Yu, Wenhui Wang, Yaoyao Chang, Yutao Sun, Li Dong, Yi Zhu, Weijiang Xu, Hangbo Bao, Zehua Wang, Shaohan Huang, Yan Xia, Furu Wei ICLR 2026 | February 2026
公開日 Aurelius: Relation Aware Text-to-Audio Generation At Scale Yuhang He, He Liang, Yash Jain, Andrew Markham, Vibhav Vineet ICLR | February 2026
公開日 EmotionThinker: Prosody-Aware Reinforcement Learning for Explainable Speech Emotion Reasoning Dingdong Wang, Shujie Liu, Tianhua Zhang, Youjun Chen, Jinyu Li, Helen M. Meng ICLR 2026 | January 2026
公開日 SALAD-VAE: Semantic Audio Compression with Language-Audio Distillation Sebastian Braun, Hannes Gamper, Dimitra Emmanouilidou 2026 International Conference on Acoustics, Speech, and Signal Processing | January 2026
公開日 Towards Real-Time Generative Speech Restoration with Flow-Matching Tsun-An Hsieh, Sebastian Braun 2026 International Conference on Acoustics, Speech, and Signal Processing | January 2026 プロジェクト
公開日 Sci-Phi: A Large Language Model Spatial Audio Descriptor Xilin Jiang, Sebastian Braun, Hannes Gamper IEEE Open Journal of Signal Processing | January 2026 プロジェクト
キャリアの機会 Research Intern – Interactive Multimodal Futures Group (Situated & Affective Computing) Posted: December 2, 2025 場所: Cambridge, MA, US; Redmond, WA, US 研究分野: Artificial intelligence, Audio and Acoustics, Computer vision, Data platforms and analytics, Graphics and multimedia, Human-computer interaction The Interactive Multimodal Futures …
動画 Spatial Audio Rendering for Speech Live Translation 11月 24, 2025 | Margarita Geleta Language barriers in virtual meetin… 01:04:38