Vision and Language Intelligence
We are focusing on understanding, reasoning, and generation across language and vision, and creation of intelligent services, including vision-to-text captioning, text-to-vision generation, and question answering/dialog about images and videos.
Design Expo 2017: MIT, Cambridge
Each year, Microsoft Research sponsors a semester-long class at leading design schools. Students are asked to form interdisciplinary teams of two to four students to design a user experience prototype that solves a real-world problem.…
Understanding Black-box Predictions via Influence Functions
How can we explain the predictions of a black-box model? In this paper, we use influence functions — a classic technique from robust statistics — to trace a model’s prediction through the learning algorithm and back to…