In the news | Business Wire
PKSHA Technology Inc. has developed one of the first Japanese-English Large Language Models (LLM) using Retentive Network (RetNet) in collaboration with Microsoft Japan Co., Ltd. Through this LLM development, PKSHA will further enhance the practicality of generative AI in the…
“要干一票大的”,这是林郅琦初入微软亚洲研究院实习时定下的目标。从 2018 年到 2024 年,通过微软-中科大联合培养博士项目,林郅琦在研究院度过了宝贵的六年时光,也收获了科研上的全方位成长。 除了产出有影响力的科研成果——以第一作者身份在系统领域顶级学术会议 OSDI 2024 和 HPCA 2024 上发表两篇论文,林郅琦还在 mentor 的指导下探索如何在大模型时代发挥系统研究的奠基性...
In the news | The Times UK
For years governments, hospitals and families have had to use frail magnetic storage for their most important data. Now, scientists have an alternative — that lasts for ever...
In the news | PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization
PIM-DL is the first deep learning framework specifically designed for off-the-shelf processing-in-memory (PIM) systems, capable of offloading most computations in neural networks. Its goal is to surmount the computational limitations of PIM hardware by replacing traditional compute-heavy matrix multiplication operations…
In the news | Devex Book Club
In this episode of the Devex Book Club podcast, Juan M. Lavista Ferres discusses the many ways that AI can have a positive impact on the world, from identifying health trends to tracking beluga whales.
| Rafah Hosn and Gretchen Huizinga
Energized by disruption, partner group product manager Rafah Hosn is helping to drive scientific advancement in AI for Microsoft. She talks about the mindset needed to work at the frontiers of AI and how the research-to-product pipeline is changing in…
编者按:检索增强生成(RAG)技术因在减少生成幻觉和虚构信息方面的显著效果,以及对知识及时更新能力的改善,正逐渐成为大语言模型系统的主流架构之一。随着 RAG 技术的广泛应用,其核心组件——向量数据库,也开始受到越来越多的关注,成为大模型中不可或缺的外挂知识库。 然而,向量数据库与传统关系型数据库有着显著区别,这给数据的统一管理、查询和更新带来了诸多不便。为此,微软亚洲研究院开发了 VBase 复...
编者按:欢迎阅读“科研上新”栏目!“科研上新”汇聚了微软亚洲研究院最新的创新成果与科研动态。在这里,你可以快速浏览研究院的亮点资讯,保持对前沿领域的敏锐嗅觉,同时也能找到先进实用的开源工具。 论文链接:https://www.microsoft.com/en-us/research/publication/an-empirical-study-on-low-gpu-utilization-of-d...
| Tobias Schnabel and Jennifer Neville
SAMMO optimizes prompts for LLMs by leveraging their structure to guide optimization. This minimizes the time and effort needed to find performant prompts on a variety of tasks.