Interactive Visual Media

Established: March 27, 2000

Overview

The Interactive Visual Media group conducts state-of-the-art research in a variety of topics related to computer vision, computer graphics and computational photography. Our goal is to develop new applications for manipulation, reasoning and communication with visual media. Core areas of research include:

  • 3D reconstruction and image segmentation
  • object recognition
  • image and video enhancement

Each year we hire exceptional PhD students for summer internships. Offers are generally made in December through March for the following summer. For more information please visit our intern webpage.

Downloads

Other groups at Microsoft Research also conducting research in computer vision and computer graphics include:

Group Alumni

  • Anandan (MSR India)
  • Kentaro Toyama (MSR India)
  • Zhengyou Zhang (MSR Redmond)
  • Harry Shum (Live Search)
  • Antonio Criminisi (MSR Cambridge)
  • Sumit Basu (MSR Redmond)
  • Nebojsa Jojic (MSR Redmond)
  • Chuck Jacobs (MSR Redmond)
  • David Salesin (University of Washington and Adobe)
  • Steve Seitz (University of Washington)
  • Shai Avidan (Adobe)
  • Phil Torr (Oxford Brookes University)
  • Ying Shan (Microsoft)
  • Yaron Caspi (Weizmann)
  • Chris Pal (University of Rochester)
  • Matthew Brown (University of British Columbia)
  • Simon Baker (NVIDIA)
  • Piotr Dollar (Facebook)
  • Wolf Kienzle (MSR Redmond)

People

Publications

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000

1999

1998

1997

1996

1995

Downloads

MSR 3D Video Dataset

March 2014

This data includes a sequence of 100 images captured from 8 cameras showing the breakdancing and ballet scenes from the paper “High-quality video view interpolation using a layered representation”, Zitnick et al., SIGGRAPH 2004. Depth maps, computed from stereo, are also included for each camera along with the calibration parameters.

Size: 721 MB

    Click the icon to access this download

  • Website

HD View Utilities (32 bit)

April 2010

The HD View Utilities are a set of tools that help you create stunning HD View images. This package includes a Photoshop plug-in that enables you to export an HD View Web page or a cross-platform Silverlight Deep Zoom page from within Photoshop. Also included is hdmake, which a simple command-line utility that provides the…

Size: 1 MB

    Click the icon to access this download

  • Website

HD View Utilities (64 bit)

April 2010

The HD View Utilities are a set of tools that help you create stunning HD View images. This package includes a Photoshop plug-in that enables you to export an HD View Web page or a cross-platform Silverlight Deep Zoom page from within Photoshop. Also included is hdmake, which a simple command-line utility that provides the…

Size: 2 MB

    Click the icon to access this download

  • Website

HD View 3.3

July 2009

With HD View, you can view and interact with very large images on the Web. HD View supports smooth panning and zooming, as well as appropriate viewing perspectives as you zoom in and out. HD View also makes creating and hosting your own HD View images easy. HD View is available as a plug-in for…

Size: 1 MB

    Click the icon to access this download

  • Website

Videos

Link description

ShadowDraw

Date

March 4, 2011

Speakers

Larry Zitnick

Link description

e-Heritage Project

Date

May 2, 2008

Speakers

Katsushi Ikeuchi

Affiliation

The University of Tokyo

Link description

The Design Process

Date

February 13, 2007

Speakers

Brendan Dawes

Affiliation

MagneticNorth

Link description

Capturing People

Date

June 12, 2006

Speakers

Adrian Hilton

Affiliation

University of Surrey

Projects

Cliplets: Juxtaposing Still and Dynamic Imagery

Established: March 6, 2012

What Are Cliplets?      Microsoft Research Cliplets is an interactive app that uses semi-automated methods to give users the power to create "Cliplets" - a type of imagery that sits between stills and video from handheld videos. The app provides a creative lens one can use to focus on important aspects of a moment by mixing static and dynamic elements from a video clip. Please see the BLINK and BLINK Cliplets page for more information about the…

Core Tools for Augmented Reality

Established: April 28, 2009

We aim to enable people with mobile devices to receive continuously updated information about their surroundings by pointing a camera. The system is able to use image recognition to augment what a person sees on the screen with 2D or 3D graphics that track their environment in real time. We demonstrate this using a treasure hunt game which guides the user along a previously authored path indoors or…

Image Composite Editor

Established: December 3, 2008

What is Image Composite Editor? Image Composite Editor (ICE) is an advanced panoramic image stitcher created by the Microsoft Research Interactive Visual Media Group. Given a set of overlapping photographs of a scene shot from a single camera location, the app creates a high-resolution panorama that seamlessly combines the original images. ICE can also create a panorama from a panning video, including stop-motion action overlaid on the background. Finished panoramas can be shared with friends…

A Flexible New Technique for Camera Calibration

Established: December 2, 1999

We propose a flexible new technique to easily calibrate a camera. It is well suited for use without specialized knowledge of 3D geometry or computer vision. The technique only requires the camera to observe a planar pattern shown at a few (at least two) different orientations. Either the camera or the planar pattern can be freely moved. The motion need not be known. Radial lens distortion is modeled. The proposed procedure consists of a closed-form…