Microsoft Research Blog

Skeleton-of-Thought: Parallel decoding speeds up and improves LLM output 

November 17, 2023 | Xuefei Ning and Zinan Lin
This research was accepted by the 2024 International Conference on Learning Representations. Large language models (LLMs) such as LLaMA and OpenAI’s GPT-4 are revolutionizing technology. However, one of the common complaints about LLMs is their speed, or lack thereof. In many cases, it takes a…

Recent Posts

  1. Research Focus: November 8, 2023 on a gradient patterned background

    Research Focus: Week of November 8, 2023 

    November 8, 2023

    Welcome to Research Focus, a series of blog posts that highlights notable publications, events, code/datasets, new hires and other milestones from across the research community at Microsoft. Generating both plausible and accurate full body avatar motion is essential for creating high quality immersive experiences in…

  2. FOCS 2023 paper: Toward developing faster algorithms for minimizing submodular functions

    Toward developing faster algorithms for minimizing submodular functions 

    November 7, 2023 | Haotian Jiang

    This research paper was presented at the 64th IEEE Symposium on Foundations of Computer Science (FOCS) 2023 (opens in new tab), a premier forum for the latest research in theoretical computer science. Submodular functions are versatile mathematical tools, finding diverse applications in real-world scenarios and…

  3. Project Silica paper at SOSP 2023

    Project Silica: Sustainable cloud archival storage in glass 

    October 26, 2023

    This research paper was presented at the 29th ACM Symposium on Operating Systems Principles (opens in new tab) (SOSP 2023), the premier forum for the theory and practice of computer systems software. For millennia, data has woven itself into every facet of our lives, from…

  4. Research Focus: October 25th

    Research Focus: Week of October 23, 2023 

    October 25, 2023

    In this issue: Kosmos-2.5: A Multimodal Literate Model; Can vine copulas explain complex relationships of weather variables; New system accelerates the adaptive training process; Structural inequalities and relational labor in the influencer industry.

  5. Research Focus October 11, 2023

    Research Focus: Week of October 9, 2023 

    October 11, 2023

    Research Focus: Principal researcher Lester Mackey recognized for pioneering statistical and ML techniques; Pareto frontiers in neural feature learning; structural inequality in the influencer industry; new research on cardinality estimation.

  6. ICCV 2023: SpaceEvo

    Efficient and hardware-friendly neural architecture search with SpaceEvo 

    October 6, 2023

    A persistent challenge in deep learning is optimizing neural network models for diverse hardware configurations, balancing performance and low latency. Learn how SpaceEvo automates hardware-aware neural architecture search to fine-tune DNN models for swift execution on diverse devices.

Explore More

  • Events & conferences

    Events & conferences 

    Meet our community of researchers, learn about exciting research topics, and grow your network

  • Podcasts

    Podcasts 

    Ongoing conversations at the cutting edge of research

  • Microsoft Research Forum

    Microsoft Research Forum 

    Join us for a continuous exchange of ideas about research in the era of general AI