Microsoft Research Blog

English

  1. Observer Effect in Social Media Use 

    May 10, 2024

    While social media data is a valuable source for inferring human behavior, its in-practice utility hinges on extraneous factors. Notable is the ā€œobserver effect,ā€ where awareness of being monitored can alter people’s social media use. We present a causal-inference study to examine this phenomenon on…

  2. End-to-End Automatic Speech Recognition 

    May 10, 2024 | Jinyu Li

    The field of automatic speech recognition (ASR) is now dominated by the end-to-end (E2E) models that directly map speech to text. In this talk, we will give an overview of the E2E ASR models and introduce the recent progress from an industry perspective. To design…

  3. AI, Cognition and the Economy

    AICE – AI, Cognition, and the Economy | Spring 2024 Workshop 

    May 9, 2024

    This was an invite-only workshop. In recent years, large language models (LLMs) and artificial intelligence have made remarkable strides in a variety of tasks such as improving question answering, natural language understanding, and machine translation. These new technologies are now beginning to change the way…

  4. White ICLR logo to the left of the first page of the accepted paper, ā€œModel Tells You What to Discard: Adaptive KV Cache Compression for LLMsā€ on a purple background.

    LLM profiling guides KV cache optimization 

    May 8, 2024 | Liyuan Liu and Jianfeng Gao

    LLMs rely on memory-intensive mechanisms like the key-value (KV) cache to store and quickly retrieve data. FastGen optimizes KV cache usage, reducing LLM memory demands by up to 50% while maintaining performance.

  5. LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought 

    May 8, 2024

    Self-correction is emerging as a promising approach to mitigate the issue of hallucination in Large Language Models (LLMs). To facilitate effective self-correction, recent research has proposed mistake detection as its initial step. However, current literature suggests that LLMs often struggle with reliably identifying reasoning mistakes…

  6. MatterSim: A Deep Learning Atomistic Model Across Elements, Temperatures and Pressures 

    May 8, 2024

    Accurate and fast prediction of materials properties is central to the digital transformation of materials design. However, the vast design space and diverse operating conditions pose significant challenges for accurately modeling arbitrary material candidates and forecasting their properties. We present MatterSim, a deep learning model…

  7. AI and the New Future of Work CFP program | New Future of Work ways we disconnect: illustration of concentric head outlines

    AI and the New Future of Work CFP | Spring 2024 

    May 7, 2024

    Language models are fundamentally changing how work gets done, and high-quality academic research is needed to ensure that the new future of work that they will help create is bright. Microsoft is soliciting proposals to fund research that will help shape the landscape of work…