Microsoft Research Blog

Artificial intelligence

  1. Optimizing JPEG XR tile structure for fast local access 

    February 21, 2010 | Chengjie Tu, Gary J. Sullivan, and Sridhar Srinivasan

    Compressed digital image technology is fundamental to a variety of consumer electronics applications. Coded images are often accessed in part rather than in whole, by accessing spatial regions of interest (ROIs). In many application architectures, the decoding process to access each ROI may occur separately…

  2. Anatomy-Driven Medical Image Search 

    January 1, 2010 | K. Simonyan, A. Zisserman, and Antonio Criminisi

    This paper presents an algorithm for content based medical image search, which allows querying by a specific region of interest. The user provides a bounding box for the region in a query image. The algorithm then retrieves images similar in content to the query with…

  3. Automatic Semantic Segmentation of Anatomical Structures in CT Scans 

    January 1, 2010

    The growing use of MDCT scans has introduced the need for radiologists to read and interpret a huge number of images in their routine workflow. An efficient, fully automatic algorithm for anatomy segmentation is introduced, which promises to improve such workflow by enabling semantic navigation…

  4. Automatic Semantic Parsing of CT Scans via Multiple Randomized Decision Trees 

    December 1, 2009 | Antonio Criminisi, Jamie Shotton, Stefano Bucciarelli, and Khan M. Siddiqui

    We introduce a new, efficient algorithm for the automatic detection and localization of anatomical structures within 3D CT images. Our algorithm builds upon recent randomized decision tree classifiers and produces accurate posterior probabilities for each of the classes (e.g. organ labels) in the training set.…

  5. Epitomic Location Recognition 

    December 1, 2009 | Kai Ni, A. Kannan, Antonio Criminisi, and John Winn

    This paper presents a novel method for location recognition, which exploits an epitomic representation to achieve both high efficiency and good generalization. A generative model based on epitomic image analysis captures the appearance and geometric structure of an environment while allowing for variations due to…

  6. The JPEG XR image coding standard [Standards in a Nutshell] 

    October 22, 2009 | F. Dufaux, Gary J. Sullivan, and T. Ebrahimi

    JPEG XR is the newest image coding standard from the JPEG committee. It primarily targets the representation of continuous-tone still images such as photographic images and achieves high image quality, on par with JPEG 2000, while requiring low computational resources and storage capacity. Moreover, it…

  7. Piecewise planar stereo for image-based rendering 

    August 31, 2009 | Sudipta Sinha, Drew Steedly, and Rick Szeliski

    We present a novel multi-view stereo method designed for image-based rendering that generates piecewise planar depth maps from an unordered collection of photographs. First a discrete set of 3D plane candidates are computed based on a sparse point cloud of the scene (recovered by structure…

  8. A Structural Support Vector Method for Extracting Contexts and Answers of Questions from Online Forums (EMNLP Conference Paper) 

    August 6, 2009 | Wen-Yun Yang, Yunbo Cao, and Chin-Yew Lin

    This paper addresses the issue of extracting contexts and answers of questions from post discussion of online forums. We propose a novel and unified model by customizing the structural Support Vector Machine method. Our customization has several attractive properties: (1) it gives a comprehensive graphical…

  9. Semi-supervised Speech Act Recognition in Emails and Forums 

    August 6, 2009 | Minwoo Jeong, Chin-Yew Lin, and Gary Geunbae Lee

    In this paper, we present a semi-supervised method for automatic speech act recognition in email and forums. The major challenge of this task is due to lack of labeled data in these two genres. Our method leverages labeled data in the Switchboard-DAMSL and the Meeting…

  10. Efficient Inference of CRFs for Large-Scale Natural Language Data 

    August 4, 2009 | Minwoo Jeong, Chin-Yew Lin, and Gary Geunbae Lee

    This paper presents an efficient inference algorithm of conditional random fields (CRFs) for large-scale data. Our key idea is to decompose the output label state into an active set and an inactive set in which most unsupported transitions become a constant. Our method unifies two…