Microsoft Research Blog

English

  1. Algorithm Generation via Creative Ideation 

    April 23, 2026 | Ruiying Ma, Chieh-Jan Mike Liang, Yanjie Gao, and Francis Y. Yan

    Designing system algorithms remains challenging, where the discontinuous nature of the solution space often forces system engineers to rely on generic heuristics at the expense of performance. We study whether LLMs can practically drive algorithm generation, and find that they are biased towards well-known generic…

  2. VidGuard-R1: AI-Generated Video Detection and Explanation via Reasoning MLLMs and RL 

    April 23, 2026

    With the rapid advancement of AI-generated videos, there is an urgent need for effective detection tools to mitigate societal risks such as misinformation and reputational harm. In addition to accurate classification, it is essential that detection models provide interpretable explanations to ensure transparency for regulators…

  3. background pattern

    ICLR 2026 

    April 23, 2026

    Microsoft is proud to have over 150 accepted papers and be a sponsor of the 14th International Conference on Learning Representations (ICLR) (opens in new tab) occurring April 23 - 27, 2026 in Rio de Janeiro, Brazil. ICLR is the premier gathering of professionals dedicated…

  4. Parallel Sampling from Masked Diffusion Models via Conditional Independence Testing 

    April 23, 2026

    Masked diffusion models (MDMs) offer a compelling alternative to autoregressive models (ARMs) for discrete text generation because they enable parallel token sampling, rather than sequential, left-to-right generation. This means potentially much faster inference. However, effective parallel sampling faces two competing requirements: (i) simultaneously updated tokens…

  5. Continuous Benchmark Generation for Evaluating Enterprise-scale LLM Agents 

    April 13, 2026

    The rapid adoption of AI agents across domains has made systematic evaluation crucial for ensuring their usefulness and successful production deployment. Evaluation of AI agents typically involves using a fixed set of benchmarks and computing multiple evaluation metrics for the agent. While sufficient for simple…

  6. Storycaster: An AI System for Immersive Room-Based Storytelling 

    April 13, 2026 | Naisha Agarwal, Judith Amores, and Andrew D. Wilson

    While Cave Automatic Virtual Environment (CAVE) systems have long enabled room-scale virtual reality and various kinds of interactivity, their content has largely remained predetermined. We present Storycaster, a generative AI CAVE system that transforms physical rooms into responsive storytelling environments. Unlike headset-based VR, Storycaster preserves…

  7. The SURE Framework: Social Intelligence for Human-Agent Collaboration 

    April 13, 2026

    Large Language Model agents are evolving from question-answering tools to genuine collaborators that run continuously, reason across turns, reflect on tool usage, and act on our behalf. Yet effective human-agent collaboration requires more than raw capability. It demands social intelligence: the ability to sense, understand,…