Microsoft Research Blog

English

  1. CodePlan: Repository-level Coding using LLMs and Planning 

    July 12, 2024

    Software engineering activities such as package migration, fixing error reports from static analysis or testing, and adding type annotations or other specifications to a codebase, involve pervasively editing the entire repository of code. We formulate these activities as repository-level coding tasks. Recent tools like GitHub…

  2. Intelligence Toolkit 

    July 11, 2024 | Dayenne Souza and Darren Edge

    The Intelligence Toolkit is a suite of interactive workflows for creating AI intelligence reports from real-world data sources. The toolkit is designed to help users identify patterns, answers, relationships, and risks within complex datasets, with generative AI (OpenAI GPT models) used to create reports on…

  3. Autoregressive Speech Synthesis without Vector Quantization 

    July 11, 2024

    We present MELLE, a novel continuous-valued tokens based language modeling approach for text to speech synthesis (TTS). MELLE autoregressively generates continuous mel-spectrogram frames directly from text condition, bypassing the need for vector quantization, which are originally designed for audio compression and sacrifice fidelity compared to…

  4. Accuracy is Not All You Need 

    July 11, 2024 | Abhinav Dutta, Sanjeev Krishnan, Nipun Kwatra, and Ramachandran Ramjee

    When Large Language Models (LLMs) are compressed using techniques such as quantization, the predominant way to demonstrate the validity of such techniques is by measuring the model's accuracy on various benchmarks.If the accuracies of the baseline model and the compressed model are close, it is…

  5. VeriSMo: A Verified Security Module for Confidential VMs 

    July 11, 2024

    Hardware vendors have introduced confidential VM architectures (e.g., AMD SEV-SNP, Intel TDX and Arm CCA) in recent years. They eliminate the trust in the hypervisor and lead to the need for security modules such as AMD Secure VMService Module (SVSM). These security modules aim to…

  6. Autoregressive Video Models 

    July 10, 2024 | Junliang Guo, Tianyu He, Li Zhao, and Jiang Bian

    Driving large video models with next token prediction In-context learning for vision data has been underexplored compared with that in natural language. Previous works studied image in-context learning, urging models to generate a single image guided by demonstrations. In this project, we propose and study…