Researchers in the Intelligent Multimedia (IM) Group are turning their ideas into reality in areas such as computer vision, image/video understanding, pattern recognition, machine learning, and cloud media. Projects are intended for the design of next generation intelligent image/video systems and pushing the state-of-the-art forward in multimedia related research. Current research directions include image/video analysis, deep learning, human understanding, scene understanding, etc.
Roles & Responsibilities
We are looking for a highly motivated intern to work on research projects on deep image/video analysis and understanding. You will work closely with researchers developing novel algorithms for image/video object detection, tracking, and segmentation, human pose and shape estimation, re-identification, and action recognition, video depth estimation, SLAM, scene understanding, and joint audio/video analysis, and writing papers aimed for top conferences and journals. The technologies you develop may also be shipped to Microsoft future products such as Microsoft Cognitive Services, Office Media, and Azure Media Analytics Services.
For this research project, we focus on the investigation/design of semi-supervised learning and the enhancement of domain generalization capability for computer vision tasks. In many practical tasks, there is usually a large performance gap between training and testing (e.g., due to domain gaps). Besides, human labeling is expensive and the amount of labeled data is limited. We will design algorithms/networks for effective domain generalization and domain adaptation.
- MS/PhD student in Computer Science, Software Engineering, Electrical Engineering or any related technical field
- Background in deep learning, computer vision, and image analysis
- Good programming skills or have related project experience
- Good communication skills and excellent teamwork
- With your advisor’s approval
Required Internship Duration:
Can commit at least 6-months internship