Microsoft Research Blog

Artificial intelligence

“I’m Not Sure, But…”: Examining the Impact of Large Language Models’ Uncertainty Expression on User Reliance and Trust

June 1, 2024

Widely deployed large language models (LLMs) can produce convincing yet incorrect outputs, potentially misleading users who may rely on them as if they were correct. To reduce such overreliance, there have been calls for LLMs to communicate their uncertainty to end users. However, there has…
FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models

June 1, 2024

In recent years, there has been significant progress in the development of text-to-image generative models. Evaluating the quality of the generative models is one essential step in the development process. Unfortunately, the evaluation process could consume a significant amount of computational resources, making the required…
WindSeer: Real-time volumetric wind prediction over complex terrain aboard a small uncrewed aerial vehicle

May 30, 2024

Real-time high-resolution wind predictions are beneficial for various applications including safe crewed and uncrewed aviation. Current weather models require too much compute and lack the necessary predictive capabilities as they are valid only at the scale of multiple kilometers and hours -- much lower spatial…
Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming

May 11, 2024 | Hussein Mozannar, Gagan Bansal, Adam Fourney, and Eric Horvitz

Code-recommendation systems, such as Copilot and CodeWhisperer, have the potential to improve programmer productivity by suggesting and auto-completing code. However, to fully realize their potential, we must understand how programmers interact with these systems and identify ways to improve that interaction. To make progress, we…
Improving Offline RL by Blending Heuristics

May 11, 2024 | Sinong Geng, Aldo Pacchiano, Andrey Kolobov, and Ching-An Cheng

We propose Heuristic Blending (HUBL), a simple performance-improving technique for a broad class of offline RL algorithms based on value bootstrapping. HUBL modifies Bellman operators used in these algorithms, partially replacing the bootstrapped values with Monte-Carlo returns as heuristics. For trajectories with higher returns, HUBL…
MatterSim: A Deep Learning Atomistic Model Across Elements, Temperatures and Pressures

May 8, 2024

Accurate and fast prediction of materials properties is central to the digital transformation of materials design. However, the vast design space and diverse operating conditions pose significant challenges for accurately modeling arbitrary material candidates and forecasting their properties. We present MatterSim, a deep learning model…
Differentially Private Synthetic Data via Foundation Model APIs 1: Images

May 7, 2024

Generating differentially private (DP) synthetic data that closely resembles the original private data without leaking sensitive user information is a scalable way to mitigate privacy concerns in the current data-driven world. In contrast to current practices that train customized models for this task, we aim…
ZeRO++: Extremely Efficient Collective Communication for Giant Model Training

May 7, 2024

Zero Redundancy Optimizer (ZeRO) has been used to train a wide range of large language models on massive GPUs clusters due to its ease of use, efficiency, and good scalability. However, when training on low-bandwidth clusters, or at scale which forces batch size per GPU…
Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation

May 7, 2024

We study the problem of in-context learning (ICL) with large language models (LLMs) on private datasets. This scenario poses privacy risks, as LLMs may leak or regurgitate the private examples demonstrated in the prompt. We propose a novel algorithm that generates synthetic few-shot demonstrations from…
Addressing Signal Delay in Deep Reinforcement Learning

May 7, 2024 | Wei Wang, Dongqi Han, Xufang Luo, and Dongsheng Li

Despite the notable advancements in deep reinforcement learning (DRL) in recent years, a prevalent issue that is often overlooked is the impact of signal delay. Signal delay occurs when there is a lag between an agent's perception of the environment and its corresponding actions. In…
Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation

May 7, 2024

This work aims at decreasing the end-to-end generation latency of large language models (LLMs). One of the major causes of the high generation latency is the sequential decoding approach adopted by almost all state-of-the-art LLMs. In this work, motivated by the thinking and writing process…
Mixture-of-Linear-Experts for Long-term Time Series Forecasting

May 2, 2024 | Ronghao Ni, Zinan Lin, Shuaiqi Wang, and Giulia Fanti

Long-term time series forecasting (LTSF) aims to predict future values of a time series given the past values. The current state-of-the-art (SOTA) on this problem is attained in some cases by linear-centric models, which primarily feature a linear mapping layer. However, due to their inherent…

No results