Coherent Depth in Stereo Vision


August 17, 2011


Christian Richardt


University of Cambridge


In this talk, I will present my recent PhD work on coherent depth in stereo vision – both in computer vision and human vision.

The first half of the talk introduces a real-time stereo matching technique that incorporates temporal evidence in real time (≥14 fps). It is based on a per-frame technique inspired by a reformulation of adaptive support weights (Yoon & Kweon 2006), which achieves a 200 times speedup compared to a standard GPU implementation. The spatio-temporal technique visibly reduces flickering and outperforms per-frame approaches in the presence of image noise. To quantitatively evaluate the depth estimation from stereo video, we created five synthetic stereo videos with ground truth disparity maps.

In the second half talk, I will introduce a novel computational model for objectively assessing the visual comfort of stereoscopic 3D imagery. The model integrates research in visual perception with tools from stereo computer vision to quantify the degree of stereo coherence between both stereo half-images. The coherence scores computed by the model strongly correlate with human comfort ratings, as shown by a perceptual study. Based on these experiments, this talk further describes a taxonomy of stereo coherence issues which affect viewing comfort, and how they can be identified and localised in stereoscopic 3D images using computational tools.

This talk is based on the following papers:

Real-time Spatiotemporal Stereo Matching Using the Dual-Cross-Bilateral Grid Christian Richardt, Douglas Orr, Ian Davies, Antonio Criminisi and Neil A.
European Conference on Computer Vision 2010 (poster + demo)

Predicting Stereoscopic Viewing Comfort Using a Coherence-Based Computational Model Christian Richardt, Lech Świrski, Ian Davies and Neil A. Dodgson Computational Aesthetics 2011, Vancouver, 5–7 August 2011


Christian Richardt

Christian Richardt is a PhD student in the Rainbow Group at the University of Cambridge Computer Laboratory. His research interests lie in visual computing, focusing on stereoscopic vision and graphics as well as non-photorealistic rendering (NPR). In his PhD, he investigates the role of coherent depth in NPR, as an input modality (think RGBZ video) and in terms of assessing stereoscopic viewing comfort. He has interned with Disney Research Zurich and MPI Informatik. Christian enjoys teaching and supervising student projects, which regularly receive project prizes. He is planning to submit in November 2011 and to graduate in March 2012.