Microsoft Research Blog

Artificial intelligence

  1. Automatic Semantic Annotation and Validation of Anatomy in DICOM CT Images 

    February 1, 2011

    In the current health-care environment, the time available for physicians to browse patients’ scans is shrinking due to the rapid increase in the sheer number of images. This is further aggravated by mounting pressure to become more productive in the face of decreasing reimbursement. Hence,…

  2. Overview of the Stereo and Multiview Video Coding Extensions of the H.264/MPEG-4 AVC Standard 

    January 30, 2011 | Anthony Vetro, Thomas Wiegand, and Gary J. Sullivan

    Significant improvements in video compression capability have been demonstrated with the introduction of the H.264/MPEG-4 advanced video coding (AVC) standard. Since developing this standard, the Joint Video Team of the ITU-T Video Coding Experts Group (VCEG) and the ISO/IEC Moving Picture Experts Group (MPEG) has…

  3. A formal environment model for multi-agent systems 

    November 7, 2010 | Paulo Salem and Ana C. V. de Melo

    Multi-agent systems are employed to model complex systems which can be decomposed into several interacting pieces called agents. In such systems, agents exist, evolve and interact within an environment. In this paper we present a model for the specification of such environments. This Environment Model…

  4. Geodesic image and video editing 

    November 5, 2010 | Antonio Criminisi, Toby Sharp, Carsten Rother, and Patrick Perez

    This article presents a new, unified technique to perform general edge-sensitive editing operations on n-dimensional images and videos efficiently.The first contribution of the article is the introduction of a Generalized Geodesic Distance Transform (GGDT), based on soft masks. This provides a unified framework to address…

  5. Spatial decision forests for MS lesion segmentation in multi-channel MR images 

    September 20, 2010

    A new algorithm is presented for the automatic segmentation of Multiple Sclerosis (MS) lesions in 3D MR images. It builds on the discriminative random decision forest framework to provide a voxel-wise probabilistic classification of the volume. Our method uses multi-channel MR intensities (T1, T2, Flair),…

  6. Regression forests for efficient anatomy detection and localization in CT studies 

    September 20, 2010 | Antonio Criminisi, Jamie Shotton, Duncan Robertson, and Ender Konukoglu

    This paper proposes multi-class random regression forests as an algorithm for the efficient, automatic detection and localization of anatomical structures within three-dimensional CT scans. Regression forests are similar to the more popular classification forests, but trained to predict continuous outputs. We introduce a new, continuous…

  7. Real-time spatiotemporal stereo matching using the dual-cross-bilateral grid 

    September 5, 2010

    We introduce a real-time stereo matching technique based on a reformulation of Yoon and Kweon's adaptive support weights algorithm [1]. Our implementation uses the bilateral grid to achieve a speedup of 200× compared to a straightforward full-kernel GPU implementation, making it the fastest technique on…

  8. Geodesic star convexity for interactive image segmentation 

    June 13, 2010

    In this paper we introduce a new shape constraint for interactive image segmentation. It is an extension of Veksler's [25] star-convexity prior, in two ways: from a single star to multiple stars and from Euclidean rays to Geodesic paths. Global minima of the energy function…