In the news | Business Wire

PKSHA harnesses RetNet to develop the first Japanese-English LLM

April 29, 2024

PKSHA Technology Inc. has developed one of the first Japanese-English Large Language Models (LLM) using Retentive Network (RetNet) in collaboration with Microsoft Japan Co., Ltd. Through this LLM development, PKSHA will further enhance the practicality of generative AI in the…

Articles

实习派 | 林郅琦：找到真正痛点，在大模型时代发挥系统研究的奠基性价值

April 27, 2024

“要干一票大的”，这是林郅琦初入微软亚洲研究院实习时定下的目标。从 2018 年到 2024 年，通过微软-中科大联合培养博士项目，林郅琦在研究院度过了宝贵的六年时光，也收获了科研上的全方位成长。除了产出有影响力的科研成果——以第一作者身份在系统领域顶级学术会议 OSDI 2024 和 HPCA 2024 上发表两篇论文，林郅琦还在 mentor 的指导下探索如何在大模型时代发挥系统研究的奠基性...

In the news | The Times UK

The tiny glass blocks that can preserve your data for centuries

April 27, 2024

For years governments, hospitals and families have had to use frail magnetic storage for their most important data. Now, scientists have an alternative — that lasts for ever...

In the news | PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization

Microsoft at ASPLOS 2024: Advancing hardware and software for high-scale, secure, and efficient modern applications

April 27, 2024

PIM-DL is the first deep learning framework specifically designed for off-the-shelf processing-in-memory (PIM) systems, capable of offloading most computations in neural networks. Its goal is to surmount the computational limitations of PIM hardware by replacing traditional compute-heavy matrix multiplication operations…

In the news | Devex Book Club

Juan M. Lavista Ferres on the power of AI for good

April 26, 2024

In this episode of the Devex Book Club podcast, Juan M. Lavista Ferres discusses the many ways that AI can have a positive impact on the world, from identifying health trends to tracking beluga whales.

Microsoft Research Podcast

Ideas: Exploring AI frontiers with Rafah Hosn

April 25, 2024 | Rafah Hosn and Gretchen Huizinga

Energized by disruption, partner group product manager Rafah Hosn is helping to drive scientific advancement in AI for Microsoft. She talks about the mindset needed to work at the frontiers of AI and how the research-to-product pipeline is changing in…

Articles

统一化数据库：为大语言模型垂域应用奠定基础

April 24, 2024

编者按：检索增强生成（RAG）技术因在减少生成幻觉和虚构信息方面的显著效果，以及对知识及时更新能力的改善，正逐渐成为大语言模型系统的主流架构之一。随着 RAG 技术的广泛应用，其核心组件——向量数据库，也开始受到越来越多的关注，成为大模型中不可或缺的外挂知识库。然而，向量数据库与传统关系型数据库有着显著区别，这给数据的统一管理、查询和更新带来了诸多不便。为此，微软亚洲研究院开发了 VBase 复...

Articles

低GPU利用率的实证研究；可解决数学问题的数据合成新范式；大规模合成数学推理的指令微调数据；大模型改进推荐系统

April 18, 2024

编者按：欢迎阅读“科研上新”栏目！“科研上新”汇聚了微软亚洲研究院最新的创新成果与科研动态。在这里，你可以快速浏览研究院的亮点资讯，保持对前沿领域的敏锐嗅觉，同时也能找到先进实用的开源工具。论文链接：https://www.microsoft.com/en-us/research/publication/an-empirical-study-on-low-gpu-utilization-of-d...

SAMMO optimizer diagram showing progression from starting prompt to optimized prompt.

Microsoft Research Blog

SAMMO: A general-purpose framework for prompt optimization

April 18, 2024 | Tobias Schnabel and Jennifer Neville

SAMMO optimizes prompts for LLMs by leveraging their structure to guide optimization. This minimizes the time and effort needed to find performant prompts on a variety of tasks.