A Cross-modal Audio Search Engine based on Joint Audio-Text Embeddings
Ad-hoc audio clips, such as those from smart speakers, social media apps, security cameras and podcasts, are being recorded and shared online on a daily basis. For a variety of applications, it is important to…
Parametric Directional Coding for Precomputed Sound Propagation
Convincing audio for games and virtual reality requires modeling directional propagation effects. The initial sound’s arrival direction is particularly salient and derives from multiply-diffracted paths in complex scenes. When source and listener straddle occluders, the…
Deep exemplar-based colorization
Tech Showcase: OpenPAI: Open Source Initiative for AI Platform in China
Open platform for AI (Open PAI) is an open-source platform for GPU cluster management and resource scheduling. PAI provides runtime environment support, GPU scheduling, and supports debugging, log collection, and port management. PAI is designed…
Tech Showcase: Project Kinect for Azure depth sensor technology
We present a prototype of time-of-flight depth-sensing technology, which will be adopted in Project Kinect for Azure as well as in the next-generation of HoloLens. This depth sensor outperforms the current state-of-the-art in terms of…