Microsoft Research Blog

English

  1. GMConv: Modulating Effective Receptive Fields for Convolutional Kernels. 

    May 22, 2024

    In convolutional neural networks (CNNs), the convolutions are conventionally performed using a square kernel with a fixed N Ɨ N receptive field (RF). However, what matters most to the network is the effective receptive field (ERF), which indicates the extent to which input pixels contribute…

  2. A whole-slide foundation model for digital pathology from real-world data 

    May 22, 2024

    Digital pathology poses unique computational challenges, as a standard gigapixel slide may comprise tens of thousands of image tiles. Prior models have often resorted to subsampling a small portion of tiles for each slide, thus missing the important slide-level context. Here we present Prov-GigaPath, a…

  3. Small Language Models for Application Interactions: A Case Study 

    May 22, 2024

    We study the efficacy of Small Language Models (SLMs) in facilitating application usage through natural language interactions. Our focus here is on a particular internal application used in Microsoft for cloud supply chain fulfilment. Our experiments show that small models can outperform much larger ones…

  4. Cookie-doh 

    May 21, 2024 | Javier Zazo

    Cookie-doh is a repository template for creating single Python package projects.

  5. xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token 

    May 21, 2024

    This paper introduces xRAG, an innovative context compression method tailored for retrieval-augmented generation. xRAG reinterprets document embeddings in dense retrieval--traditionally used solely for retrieval--as features from the retrieval modality. By employing a modality fusion methodology, xRAG seamlessly integrates these embeddings into the language model representation…

  6. Microsoft Research Podcast - Abstracts | May 20, 2024 | Andrey Kolobov

    Abstracts: May 20, 2024 

    May 20, 2024 | Andrey Kolobov and Gretchen Huizinga

    Andrey Kolobov discusses WindSeer, a small CNN capable of estimating the wind field around an sUAV in flight more finely and with less compute and data than traditional models. The advancement can help support longer and safer autonomous flights.

  7. To Err Is Human, How about Medical Large Language Models? Comparing Pre-trained Language Models for Medical Assessment Errors and Reliability 

    May 20, 2024 | Wen-wai Yim, Yujuan Fu, Asma Ben Abacha, and Meliha Yetisgen

    Unpredictability, especially unpredictability with unknown error characteristics, is a highly undesirable trait, particularly in medical patient care applications. Although large pre-trained language models (LLM) have been applied to a variety of unseen tasks with highly competitive and successful results, their sensitivity to language inputs and…

  8. Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search 

    May 19, 2024 | Sebastian Bruch, Aditya Krishnan, and F. M. Nardini

    Clustering-based nearest neighbor search is an effective method in which points are partitioned into geometric shards to form an index, with only a few shards searched during query processing to find a set of top-$k$ vectors. Even though the search efficacy is heavily influenced by…