Microsoft Research Blog

Artificial intelligence

LLF-Bench: Benchmark for Interactive Learning from Language Feedback

December 11, 2023

We introduce a new benchmark, LLF-Bench (Learning from Language Feedback Benchmark; pronounced as “elf-bench”), to evaluate the ability of AI agents to interactively learn from natural language feedback and instructions. Learning from language feedback (LLF) is essential for people, largely because the rich information this…
Chanakya: Learning Runtime Decisions for Adaptive Real-Time Perception

December 10, 2023

Real-time perception requires planned resource utilization. Computational planning in real-time perception is governed by two considerations – accuracy and latency. There exist run-time decisions (e.g. choice of input resolution) that induce tradeoffs affecting performance on a given hardware, arising from intrinsic (content, e.g. scene clutter)…
DiSK: A Diffusion Model for Structured Knowledge

December 8, 2023 | Ouail Kitouni, Niklas Nolte, James Hensman, and Bhaskar Mitra

Structured (dictionary-like) data presents challenges for left-to-right language models, as they can struggle with structured entities for a wide variety of reasons such as formatting and sensitivity to the order in which attributes are presented. Tabular generative models suffer from a different set of limitations…
Axiomatic Preference Modeling for Longform Question Answering

December 6, 2023

The remarkable abilities of large language models (LLMs) like GPT-4 partially stem from post-training processes like Reinforcement Learning from Human Feedback (RLHF) involving human preferences encoded in a reward model. However, these reward models (RMs) often lack direct knowledge of why, or under…
MatterGen: a generative model for inorganic materials design

December 6, 2023

The design of functional materials with desired properties is essential in driving technological advances in areas like energy storage, catalysis, and carbon capture. Generative models provide a new paradigm for materials design by directly generating entirely novel materials given desired property constraints. Despite recent progress,…
Early LLM-based Tools for Enterprise Information Workers Likely Provide Meaningful Boosts to Productivity

December 4, 2023

This report presents the initial findings of Microsoft’s research initiative on “AI and Productivity”, which seeks to measure and accelerate the productivity gains created by LLM-powered productivity tools like Microsoft’s Copilot. The many studies summarized in this report, the initiative’s first, focus on common enterprise…
Promoting Topic Coherence and Inter-Document Consorts in Multi-Document Summarization via Simplicial Complex and Sheaf Graph

December 1, 2023 | Yash Kumar Atri, Arun Iyer, Tanmoy Chakraborty, and Vikram Goyal

Multi-document Summarization (MDS) characterizes compressing information from multiple source documents to its succinct summary. An ideal summary should encompass all topics and accurately model cross-document relations expounded upon in the source documents. However, existing systems either impose constraints on the length of tokens during the…
TaskWeaver: A Code-First Agent Framework

December 1, 2023

Large Language Models (LLMs) have shown impressive abilities in natural language understanding and generation, leading to their use in applications such as chatbots and virtual assistants. However, existing LLM frameworks face limitations in handling domain-specific data analytics tasks with rich data structures. Moreover, they struggle…
Training Private and Efficient Language Models with Synthetic Data from LLMs

December 1, 2023

Language models are pivotal in modern text-based applications, offering many productivity features like next-word prediction, smart composition, and summarization. In many applications, these models must be lightweight to meet inference time and computational cost requirements. Furthermore, due to the inherent sensitivity of their training data,…
MM-Reasoner: A Multi-Modal Knowledge-Aware Framework for Knowledge-Based Visual Question Answering

December 1, 2023 | Mahmoud Khademi, Ziyi Yang, Felipe Vieira Frujeri, and Chenguang Zhu

Thanks to the strong reasoning capabilities of Large Language Models (LLMs), recent approaches to knowledge-based visual question answering (KVQA) utilize LLMs with a global caption of an input image to answer a question. However, these approaches may miss key visual information that is not captured…
Differentially Private Approximate Near Neighbor Counting in High Dimensions

December 1, 2023 | Alexandr Andoni, Piotr Indyk, Sepideh Mahabadi, and Shyam Narayanan

Range counting (e.g., counting the number of data points falling into a given query ball) under differential privacy has been studied extensively. However, the current algorithms for this problem are subject to the following dichotomy. One class of algorithms suffers from an additive error that…
Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine

November 28, 2023

Generalist foundation models such as GPT-4 have displayed surprising capabilities in a wide variety of domains and tasks. Yet, there is a prevalent assumption that they cannot match specialist capabilities without intensive training of models with specialty knowledge. For example, most explorations to date on…

No results