Microsoft Research Blog

Artificial intelligence

Real-time RGB-D camera relocalization

December 23, 2013 | Ben Glocker, Shahram Izadi, Jamie Shotton, and Antonio Criminisi

We introduce an efficient camera relocalization approach which can be easily integrated into real-time 3D reconstruction methods, such as KinectFusion. Our approach makes use of compact encoding of whole image frames which enables both online harvesting of keyframes in tracking mode, and fast retrieval of…
Decision Jungles: Compact and Rich Models for Classification

December 5, 2013

Randomized decision trees and forests have a rich history in machine learning and have seen considerable success in application, perhaps particularly so for computer vision. However, they face a fundamental limitation: given enough data, the number of nodes in decision trees will grow exponentially with…
Finding Actors and Actions in Movies

December 1, 2013

We address the problem of learning a joint model of actors and actions in movies using weak supervision provided by scripts. Specifically, we extract actor/action pairs from the script and use them as constraints in a discriminative clustering framework. The corresponding optimization problem is formulated…
Efficient Human Pose Estimation from Single Depth Images

December 1, 2013

We describe two new approaches to human pose estimation. Both can quickly and accurately predict the 3D positions of body joints from a single depth image without using any temporal information. The key to both approaches is the use of a large, realistic, and highly…
Regression forests for efficient anatomy detection and localization in computed tomography scans

December 1, 2013

This paper proposes a new algorithm for the efficient, automatic detection and localization of multiple anatomical structures within three-dimensional computed tomography (CT) scans. Applications include selective retrieval of patients images from PACS systems, semantic visual navigation and tracking radiation dose over time. The main contribution…
Whole-body anatomy localization via classification and regression forests.

December 1, 2013 | Bjoern H. Menze, Georg Langs, Zhuowen Tu, and Antonio Criminisi

na
Behaviorist Agent Architecture

November 16, 2013 | Paulo Salem and Ana C. V. de Melo

In this article we propose an agent architecture based on Behavior Analysis, a behaviorist psychology theory. The main characteristic of this theory is the description of complex behavior exclusively in terms of stimulation and behavioral responses. That is to say, in terms of observable and…
Neighbourhood approximation using randomized forests [Best Paper Award]

October 1, 2013 | Ender Konukoglu, Ben Glocker, D. Zikic, and Antonio Criminisi

Leveraging available annotated data is an essential component of many modern methods for medical image analysis. In particular, approaches making use of the “neighbourhood” structure between images for this purpose have shown significant potential. Such techniques achieve high accuracy in analysing an image by propagating…
Vertebrae localization in pathological spine CT via dense classification from sparse annotations.

September 22, 2013

Accurate localization and identification of vertebrae in spinal imaging is crucial for the clinical tasks of diagnosis, surgical planning, and post-operative assessment. The main difficulties for automatic methods arise from the frequent presence of abnormal spine curvature, small field of view, and image artifacts caused…
Modality propagation: coherent synthesis of subject-specific scans with data-driven regularization.

September 22, 2013

We propose a general database-driven framework for coherent synthesis of subject-specific scans of desired modality, which adopts and generalizes the patch-based label propagation (LP) strategy. While modality synthesis has received increased attention lately, current methods are mainly tailored to specific applications. On the other hand,…
WESD–Weighted Spectral Distance for Measuring Shape Dissimilarity

September 1, 2013 | Ender Konukoglu, Ben Glocker, Antonio Criminisi, and K. M. Pohl

This paper presents a new distance for measuring shape dissimilarity between objects. Recent publications introduced the use of eigenvalues of the Laplace operator as compact shape descriptors. Here, we revisit the eigenvalues to define a proper distance, called Weighted Spectral Distance (WESD), for quantifying shape…
The any-combiner for multi-agent target classification

July 8, 2013 | Nathan Parrish and Ashley J. Llorens

The any-combiner is a classifier combination approach for target classification problems in which the target class can be naturally decomposed into multiple subclasses. This kind of classification problem can often occur in sensor-based system applications, such as biometric user verification, biosurveillance or underwater mine detection,…

No results