[CV] [Google Scholar]

Sudipta Sinha is a researcher in the Interactive Media (IMG) Group at Microsoft Research Redmond. His research interests lie broadly in computer vision, robotics and computer graphics. He works on various topics related to 3D scene reconstruction from images and video — structure from motion, SLAM, visual odometry, stereo matching, optical flow, multi-view stereo, photometric stereo, image-based localization and 6D object detection and pose estimation. He is interested in applications such as depth sensing, augmented reality (AR) and UAV-based aerial photogrammetry and 3D scanning.

He received his M.S. and Ph.D. from the University of North Carolina at Chapel Hill in 2005 and 2009 respectively where he studied topics in geometric 3D computer vision. He was a member of the Urbanscape team that received the best demo award at CVPR 2007 for one of the first scalable, real-time, vision-based urban 3D reconstruction systems. He has served/will serve as an area chair for 3DV 2016, ICCV 2017 and 3DV 2018 and was a program co-chair for 3DV 2017. He also serves as an associate editor for the Computer Vision and Image Understanding (CVIU) Journal.




Fast Multi-frame Stereo Scene Flow with Motion Segmentation

CVPR 2017

www | pdf | extended-pdf | youtube

Flight Dynamics-based Recovery of a UAV Trajectory using Ground Cameras

CVPR 2017

www | pdf | dataset | supp | video


FarmBeats: An IoT Platform for Data-Driven Agriculture

NSDI 2017

www | pdf

Multiview Rectification of Folded Documents

TPAMI 2017

www | pdf


Efficient and Robust Color Consistency for Community Photo Collections


CVPR 2016

www | pdf | supplementary


Joint Recovery of Dense Correspondence and Cosegmentation in Two Images

CVPR 2016

www | pdf | supplementary | dataset


Monocular Localization of a moving person onboard a Quadrotor MAV

ICRA 2015

www | pdf | video2 | dataset


Calibrating a non-isotropic near point light source using a plane

CVPR 2014

www | pdf | supp

High-Resolution Stereo Matching using Local Plane Sweeps

High Resolution Stereo Matching

CVPR 2014

www | pdf


3D Spin Movies and Photosynth 2

December 2013

MSR Blog | Techfest 2011 | Photosynth2-tutorial



Multiview Photometric Stereo using Planar Mesh Parameterization

Multiview Photometric Stereo using Planar Mesh Parameterization

ICCV 2013, TPAMI 2016

www | pdf | video2 | dataset


Leveraging Structure from Motion to Learn Discriminative Codebooks for Scalable Landmark Classification

CVPR 2013

www | pdf | sup


Detecting and Reconstructing 3D Mirror Symmetric Objects

ECCV 2012

www | pdf



Multiple View Object Cosegmentation using Appearance and Stereo Cues

ECCV 2012

www | pdf | supp | dataset



Real-time Image-based 6-DOF Localization in Large-Scale Environments

CVPR 2012

www | pdf | video | poster


Image-Based Rendering for Scenes with Reflections


www | pdf | video


Structure from motion for scenes with large duplicate structures

CVPR 2011

www | pdf | datasets


A linear approach to structure from motion

RMLE – ECCV workshop 2010

www | supplementary


Piecewise Planar Stereo for Image-based Rendering

ICCV 2009

www | pdf | video


Interactive 3D Architectural Modeling from Unordered Photo Collections


SIGGRAPH Asia 2008

www | pdf | supp



Organization, Service


Conference Program Committee

  • ACM Multimedia Conference 2016.
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2009 — 2017.
  • British Machine Vision Conference (BMVC) 2017.
  • 3D Processing, Visualization and Transmission (3DPVT 2012, 3DV 2013, 2014, 2015).
  • Asian Conference on Computer Vision (ACCV) 2009, 2010, 2012, 2014, 2016.
  • European Conference on Computer Vision (ECCV) 2008, 2010, 2012, 2014, 2016.
  • IEEE International Conference on Computer Vision (ICCV) 2009, 2011, 2013, 2015.
  • Indian Conference on Vision, Graphics and Image Processing (ICVGIP) 2010, 2012, 2014, 2016.

Workshop Program Committee

  • Workshop on Performance Metrics for Correspondence Problems (CVPR 2015).
  • Ground Truth – What is a good dataset ? (CVPR 2013 Workshop).
  • Workshop on Unsolved Problems in Optic Flow and Stereo Estimation (ECCV 2012).
  • Consumer Depth Cameras for Computer Vision (ECCV 2012 Workshop).
  • Vision and Graphics Computing for Multimedia Communications (ICME 2011 Workshop).
  • Reconstruction and Modeling of Large-Scale 3D Virtual Environments (ECCV 2010Workshop).
  • Computer Vision on GPUs (CV-GPU) (ECCV 2010 Workshop).
  • Dynamic 3D Imaging (DAGM 2009 Workshop).
  • Time of Flight Camera based Computer Vision (TOF{CV), (CVPR 2008 Workshop).


  • SIGGRAPH 2008–2017, SIGGRAPH Asia 2009–2012, 2016
  • EuroGraphics 2012, 2014, 2015, 2016
  • ICRA 2015, 2016
  • IROS 2016
  • ACM Transaction on Graphics (ToG)
  • International Journal of Computer Vision (IJCV)
  • IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI)
  • IEEE Transactions on Visualization and Computer Graphics (TVCG)
  • IEEE Transactions of Computational Imaging (TCI)
  • IEEE Transactions on Multimedia (T-MM)
  • IEEE Transaction on Image Processing (TIP)
  • Computer Vision and Image Understanding (CVIU)
  • Journal of Visual Communication and Image Representation (JVCI)
  • Machine Vision and Applications (MVA)
  • Image and Visual Computing (IVC)
  • IEEE Pervasive Computing
  • Optics



  • CVPR 2017 Tutorial: Geometric and Semantic 3D Reconstruction
  • 3DV 2016 Tutorial: Semantic and Structured 3D Modeling


coming soon …


coming soon …