Microsoft Research Blog

English

  1. Skala 

    Skala is a neural network-based exchange-correlation functional for density functional theory (DFT), developed by Microsoft Research AI for Science. It leverages deep learning to predict exchange-correlation energies from electron density features, achieving chemical accuracy for atomization energies and strong performance on broad thermochemistry and kinetics…

  2. Improving Language Agents Through BREW 

    September 29, 2025

    Large Language Model (LLM)-based agents are increasingly applied to tasks requiring structured reasoning, tool use, and environmental adaptation, such as data manipulation, multistep planning, and computer-use automation. However, despite their versatility, current training paradigms for model weight optimization methods, like PPO and GRPO, remain relatively…

  3. PerfBench: Can Agents Resolve Real-World Performance Bugs? 

    September 27, 2025 | Spandan Garg, Roshanak Zilouchian Moghaddam, and Neel Sundaresan

    Performance bugs are inefficiencies in software that waste computational resources without causing functional failures, making them particularly challenging to detect and fix. While recent advances in Software Engineering agents have shown promise in automated bug fixing, existing benchmarks primarily focus on functional correctness and fail…

  4. graphical user interface, application, website

    Understanding How Users Prepare for and React to Smartphone Theft 

    September 24, 2025 | Divyanshu Bhardwaj

    Smartphone theft is common, yet little research explores how users prepare for or respond to such incidents. To address this gap in the literature, we conducted 20 semi-structured interviews with victims who had experienced smartphone theft in the past two years. These cases ranged from…

  5. graphical user interface, website

    When LLMs Go Online: The Emerging Threat of Web-Enabled LLMs 

    September 24, 2025 | Hanna Kim

    Recent advancements in Large Language Models (LLMs) have established them as agentic systems capable of planning and interacting with various tools. These LLM agents are often paired with web-based tools, enabling access to diverse sources and real-time information. Although these advancements offer significant benefits across…

  6. A Formal Analysis of Apple’s iMessage PQ3 Protocol 

    September 24, 2025 | Felix Linker

    We present the formal verification of Apple's iMessage PQ3, a highly performant, device-to-device messaging protocol offering strong security guarantees even against an adversary with quantum computing capabilities. PQ3 leverages Apple's identity services together with a custom, post-quantum secure initialization phase and afterwards it employs a…