Microsoft Research Blog

Artificial intelligence

  1. Constructing Virtual Cities by Using Panoramic Images 

    July 31, 2004 | Katsushi Ikeuchi, Masao Sakauchi, Hiroshi Kawasaki, and Imari Sato

    Simultaneously acquired omni-directional images contain rays of 360 degree viewing directions. To take advantage of this unique characteristic, we have been developing several methods for constructing virtual cities. In this paper, we first describe a system to generate the appearance of a virtual citys the…

  2. ROUGE: a Package for Automatic Evaluation of Summaries 

    July 25, 2004 | Chin-Yew Lin

    ROUGE stands for Recall-Oriented Understudy for Gisting Evaluation. It includes measures to automatically determine the quality of a summary by comparing it to other (ideal) summaries created by humans. The measures count the number of overlapping units such as n-gram, word sequences, and word pairs…

  3. Gait recognition using image self-similarity 

    April 1, 2004 | Chiraz BenAbdelkader, Ross Cutler, and Larry S. Davis

    Gait is one of the few biometrics that can be measured at a distance, and is hence useful for passive surveillance as well as biometric applications. Gait recognition research is still at its infancy, however, and we have yet to solve the fundamental issue of…

  4. Stereo reconstruction from multiperspective panoramas 

    December 31, 2003 | Yin Li, Harry Shum, Chi-Keung Tang, and Rick Szeliski

    A new approach to computing a panoramic (360 degrees) depth map is presented in this paper. Our approach uses a large collection of images taken by a camera whose motion has been constrained to planar concentric circles. We resample regular perspective images to produce a…

  5. Gaze Manipulation for One-to-one Teleconferencing 

    October 13, 2003 | Antonio Criminisi, Jamie Shotton, Andrew Blake, and Philip H.S. Torr

    A new algorithm is proposed for novel view generation in one-to-one teleconferencing applications. Given the video streams acquired by two cameras placed on either side of a computer monitor, the proposed algorithm synthesises images from a virtual camera in arbitrary position (typically located within the…

  6. Cross-lingual C*ST*RD: English Access to Hindi Information 

    September 1, 2003

    We present C*ST*RD, a cross-language information delivery system that supports cross-language information retrieval, information space visualization and navigation, machine translation, and text summarization of single documents and clusters of documents. C*ST*RD was assembled and trained within 1 month, in the context of DARPA’s Surprise Language…

  7. Extraction of essential interactions through multiple observations of human demonstrations 

    July 27, 2003 | K. Ogawara, Jun Takamatsu, H. Kimura, and Katsushi Ikeuchi

    This paper describes a new approach on how to teach a robot everyday manipulation tasks under the "Learning from Observation" framework. In this approach, human demonstrations, which are made up of mutual interactions between a grasped object and an environmental object, are observed and a…