Microsoft Research Blog

Artificial intelligence

Constructing Virtual Cities by Using Panoramic Images

July 31, 2004 | Katsushi Ikeuchi, Masao Sakauchi, Hiroshi Kawasaki, and Imari Sato

Simultaneously acquired omni-directional images contain rays of 360 degree viewing directions. To take advantage of this unique characteristic, we have been developing several methods for constructing virtual cities. In this paper, we first describe a system to generate the appearance of a virtual citys the…
ROUGE: a Package for Automatic Evaluation of Summaries

July 25, 2004 | Chin-Yew Lin

ROUGE stands for Recall-Oriented Understudy for Gisting Evaluation. It includes measures to automatically determine the quality of a summary by comparing it to other (ideal) summaries created by humans. The measures count the number of overlapping units such as n-gram, word sequences, and word pairs…
The SPS algorithm: patching figural continuity and transparency by Split-Patch Search

July 19, 2004 | Antonio Criminisi and Andrew Blake

This paper describes a novel algorithm for the efficient synthesis of high-quality virtual views from only two input images. The emphasis is on the recovery of continuity of objects boundaries (figural continuity) with faithful synthesis of transparency effects. The contribution of this paper is two-fold:…
Looking for a Few Good Metrics: Automatic Summarization Evaluation – How Many Samples Are Enough?

June 2, 2004 | Chin-Yew Lin

ROUGE stands for Recall-Oriented Understudy for Gisting Evaluation. It includes measures to automatically determine the quality of a summary by comparing it to other (ideal) summaries created by humans. The measures count the number of overlapping units such as n-gram, word sequences, and word pairs…
Introduction to the Special Issue on Statistical Language Modeling

June 1, 2004 | Jianfeng Gao and Chin-Yew Lin

The goal of statistical language modeling (SLM) is to estimate the likelihood (or probability) of a word string. SLM is fundamental to many natural language applications like automatic speech recognition (ASR) [Jelinek 1990], statistical machine translation (SMT) [Brown et al. 1993], and Asian language text…
High-quality linear interpolation for demosaicing of Bayer-patterned color images

May 16, 2004 | Henrique S. Malvar, Li-wei He, and Ross Cutler

This paper introduces a new interpolation technique for demosaicing of color images produced by single-CCD digital cameras. We show that the proposed simple linear filter can lead to an improvement in PSNR of over 5.5 dB when compared to bilinear demosaicing, and about 0.7 dB…
Gait recognition using image self-similarity

April 1, 2004 | Chiraz BenAbdelkader, Ross Cutler, and Larry S. Davis

Gait is one of the few biometrics that can be measured at a distance, and is hence useful for passive surveillance as well as biometric applications. Gait recognition research is still at its infancy, however, and we have yet to solve the fundamental issue of…
Stereo reconstruction from multiperspective panoramas

December 31, 2003 | Yin Li, Harry Shum, Chi-Keung Tang, and Rick Szeliski

A new approach to computing a panoramic (360 degrees) depth map is presented in this paper. Our approach uses a large collection of images taken by a camera whose motion has been constrained to planar concentric circles. We resample regular perspective images to produce a…
Issues, Tasks and Program Structures to Roadmap Research in Question & Answering (Q&A)

October 24, 2003

Recently the Vision Statement to Guide Research in Question Answering (Q&A) and Text Summarization outlined a deliberately ambitious vision for research in Q&A. This vision is a challenge to the Roadmap Committee to define the program structures capable of addressing the question processing and answer…
Gaze Manipulation for One-to-one Teleconferencing

October 13, 2003 | Antonio Criminisi, Jamie Shotton, Andrew Blake, and Philip H.S. Torr

A new algorithm is proposed for novel view generation in one-to-one teleconferencing applications. Given the video streams acquired by two cameras placed on either side of a computer monitor, the proposed algorithm synthesises images from a virtual camera in arbitrary position (typically located within the…
Cross-lingual C*ST*RD: English Access to Hindi Information

September 1, 2003

We present C*ST*RD, a cross-language information delivery system that supports cross-language information retrieval, information space visualization and navigation, machine translation, and text summarization of single documents and clusters of documents. C*ST*RD was assembled and trained within 1 month, in the context of DARPA’s Surprise Language…
Extraction of essential interactions through multiple observations of human demonstrations

July 27, 2003 | K. Ogawara, Jun Takamatsu, H. Kimura, and Katsushi Ikeuchi

This paper describes a new approach on how to teach a robot everyday manipulation tasks under the "Learning from Observation" framework. In this approach, human demonstrations, which are made up of mutual interactions between a grasped object and an environmental object, are observed and a…

No results