Learning to be a Depth Camera for Close-Range Human Capture and Interaction

August 11, 2014
Shahram Izadi | Microsoft Research

We present a machine learning technique for estimating absolute, per-pixel depth using any conventional monocular 2D camera, with minor hardware modifications. Our approach targets close-range human capture and interaction where dense 3D estimation of hands and faces is desired. We use hybrid classification-regression forests to learn how to map from near infrared intensity images to absolute, metric depth in real-time. We demonstrate a variety of human computer interaction and capture scenarios. Experiments show an accuracy that outperforms a conventional light fall-off baseline, and is comparable to high-quality consumer depth cameras, but with a dramatically reduced cost, power consumption, and form-factor.

- Antonio Criminisi
  
  Principal Researcher
- David Sweeney
  
  Principal Industrial Designer
- Jamie Shotton
  
  Partner Director of Science
- Pushmeet Kohli
  
  Principal Research Manager Director of Research Microsoft Research
- Sing Bing Kang
  
  Principal Researcher
- Sean Fanello
  
  Post Doc Researcher
- Tim Paek
  
  Principal Researcher, Research Manager
Research Area
- Artificial intelligence
Project
- Learning to be a depth camera for close-range human capture and interaction
Publication
- Learning to be a Depth Camera for Close-Range Human Capture and Interaction [Best Demo Honorable Mention Award]