Microsoft Research Blog

English

  1. MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model 

    September 3, 2024

    Generative models aim to simulate realistic effects of various actions across different contexts, from text generation to visual effects. Despite efforts to build real-world simulators, leveraging generative models for virtual worlds, like financial markets, remains underexplored. In financial markets, generative models can simulate market effects…

  2. Highly Accurate Real-space Electron Densities with Neural Networks 

    September 2, 2024

    Variational ab initio methods in quantum chemistry stand out among other methods in providing direct access to the wave function. This allows, in principle, straightforward extraction of any other observable of interest, besides the energy, but, in practice, this extraction is often technically difficult and computationally…

  3. The Paradox of Spreadsheet Self-Efficacy: Social Incentives for Informal Knowledge Sharing in End-User Programming 

    September 1, 2024 | Advait Sarkar, Qing (Nancy) Xia, Duncan P. Brumby, and Anna Cox

    Informal Knowledge Sharing (KS) is vital for enduser programmers to gain expertise. To better understand how personal (self-efficacy), social (reputational gains, trust between colleagues), and software-related (codification effort) variables influence spreadsheet KS intention, we conducted a multiple regressions analysis based on survey data from spreadsheet…

  4. Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely 

    September 1, 2024

    Large language models (LLMs) augmented with external data have demonstrated remarkable capabilities in completing real-world tasks. Techniques for integrating external data into LLMs, such as Retrieval-Augmented Generation (RAG) and fine-tuning, are gaining increasing attention and widespread application. Nonetheless, the effective deployment of data-augmented LLMs across…

  5. Datacenter power and energy management: past, present, and future 

    September 1, 2024 | Ricardo Bianchini, Christian Belady, and Anand Sivasubramaniam

    This article overviews some of the key past developments in cloud datacenter power and energy management, where we are today, and what the future could be. This topic is gaining enormous, renewed interest in the context of the conflicting needs of the AI revolution and…

  6. COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning 

    September 1, 2024

    We present a cost-effective method to integrate speech into a large language model (LLM), resulting in a Contextual Speech Model with Instruction-following/in-context-learning Capabilities (COSMIC) multi-modal LLM. Using GPT-3.5, we generate Speech Comprehension Test Question-Answer (SQA) pairs from speech transcriptions for supervised instruction tuning. With under…