Microsoft Research Blog

English

  1. Decorative graphic with wavy shapes in the background in blues and purples. Text overlay in center left reads: “Research Focus: August 26, 2024”

    Research Focus: Week of August 26, 2024 

    August 28, 2024

    Learn what’s next for AI at Research Forum on Sept. 3;  WizardArena simulates human-annotated chatbot games; MInference speeds pre-filling for long-context LLMs via dynamic sparse attention; Reef: Fast succinct non-interactive zero-knowledge regex proofs.

  2. WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback 

    August 28, 2024

    As large language models (LLMs) continue to advance, aligning these models with human preferences has emerged as a critical challenge. Traditional alignment methods, relying on human or LLM annotated datasets, are limited by their resource-intensive nature, inherent subjectivity, and the risk of feedback loops that…

  3. DDS: DPU-optimized Disaggregated Storage [Extended Report] 

    August 28, 2024

    This extended report presents DDS, a novel disaggregated storage architecture enabled by emerging networking hardware, namely DPUs (Data Processing Units). DPUs can optimize the latency and CPU consumption of disaggregated storage servers. However, utilizing DPUs for DBMSs requires careful design of the network and storage…

  4. Decoding the AI Pen: Techniques and Challenges in Detecting AI-Generated Text 

    August 24, 2024 | Sara Abdali, Richard Anarfi, CJ Barberan, and Jia He

    Large Language Models (LLMs) have revolutionized the field of Natural Language Generation (NLG) by demonstrating an impressive ability to generate human-like text. However, their widespread us age introduces challenges that necessitate thoughtful examination, ethical scrutiny, and responsible practices. In this study, we delve into these…

  5. background pattern

    Interactive World Simulator 

    August 23, 2024 | Tianyu He, Junliang Guo, and Jiang Bian

    Learning to simulate the visual world from large-scale videos. In-context learning for vision data has been underexplored compared with that in natural language. Previous works studied image in-context learning, urging models to generate a single image guided by demonstrations. In this project, we propose and…

  6. Extreme Meta-Classification for Large-Scale Zero-Shot Retrieval 

    August 23, 2024

    We develop accurate and efficient solutions for large-scale retrieval tasks where novel (zero-shot) items can arrive continuously at a rapid pace. Conventional Siamese-style approaches embed both queries and items through a small encoder and retrieve the items lying closest to the query. While this approach…

  7. ML for High-Performance Climate and Earth Virtualization Engines 

    August 22, 2024

    Speaker: Torsten HoeflerHost: Karin Strauss Machine learning presents a great opportunity for Climate simulation and research. We will discuss some ideas from the Earth Virtualization Engines summit in Berlin and several research results ranging from ensemble prediction and bias correction of simulation output, extreme compression…

  8. photo of Lex Story for the What's Your Story episode of the Microsoft Research podcast

    What’s Your Story: Lex Story 

    August 22, 2024 | Johannes Gehrke and Lex Story

    Model maker and fabricator Lex Story helps bring research to life through prototyping. He discusses his take on failure; the encouragement and advice that has supported his pursuit of art and science; and the sabbatical that might inspire his next career move.

  9. Controllable Financial Market Generation with Diffusion Guided Meta Agent 

    August 22, 2024

    Order flow modeling stands as the most fundamental and essential financial task, as orders embody the minimal unit within a financial market. However, current approaches often result in unsatisfactory fidelity in generating order flow, and their generation lacks controllability, thereby limiting their application scenario. In…