Microsoft at ASPLOS 2024: Advancing hardware and software for high-scale, secure, and efficient modern applications
公開日 Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale Kunal Jain, A. Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Ruehle, Saravan Rajmohan, Shashwat Jaiswal, Yogesh Simmhan, Anoop Kulkarni, Steve Kofsky ACM Sigmetrics 2026 | June 2026 プロジェクト
公開日 DroidSpeak: Efficient Context Sharing for Multiple-LLM Inference Yuhan Liu, Yuyang Huang, Jiayi Yao, Zhuohan Gu, Kuntai Du, Hanchen Li, Yihua Cheng, Junchen Jiang, Shan Lu, Madan Musuvathi, Esha Choukse NSDI | May 2026 プロジェクト
公開日 Harvesting Spare CPU Resources in Container Systems Adam Hall, Anirudh Sarma, Esha Choukse, Kishore Ramachandran, Sameh Elnikety NSDI | May 2026
公開日 Concord: Learning Network Configuration Contracts Ryan Beckett, Francis Y. Yan, Raghunadha Reddy Pocha, Vineesh V. Raj, Ayyub Shaik, Siva Kesava Reddy Kakarla 2026 European Conference on Computer Systems | April 2026
公開日 Algorithm Generation via Creative Ideation Ruiying Ma, Chieh-Jan Mike Liang, Yanjie Gao, Francis Y. Yan ICLR (International Conference on Learning Representations) | April 2026
公開日 VeriStruct: AI-assisted Automated Verification of Data-Structure Modules in Verus Chuyue Sun, Yican Sun, Ethan Zhang, Daneshvar Amrollahi, Shuvendu Lahiri, Shan Lu, David Dill, Clark Barrett International Conference on Tools and Algorithms for the Construction and Analysis of Systems (TACAS) | April 2026 プロジェクト
公開日 NetArena: Dynamic Benchmarks for AI Agents in Network Automation Yajie Zhou, Jiajun Ruan, Eric S. Wang, Sadjad Fouladi, Francis Y. Yan, Kevin Hsieh, Zaoxing Liu ICLR 2026 | April 2026 Ranked 1st on the "Coding Agent" benchmark in Berkeley's AgentX competition
公開日 QoServe : Breaking the Silos of LLM Inference Serving Kanishk Goel, Jayashree Mohan, Nipun Kwatra, Ravi Shreyas Anupindi, Ramachandran Ramjee Architectural Support for Programming Languages and Operating Systems (ASPLOS) 2026 | March 2026 プロジェクト
公開日 MSCCL++: Rethinking GPU Communication Abstractions for AI Inference Changho Hwang, Peng Cheng, Roshan Dathathri, Abhinav Jangda, Saeed Maleki, Madan Musuvathi, Olli Saarikivi, Aashaka Shah, Ziyue Yang, Binyang Li, Caio Rocha, Qinghua Zhou, Mahdieh Ghazimirsaeed, Sreevatsa Anantharamu, Jithin Jose ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) | March 2026
Microsoft Research ブログ Project Silica’s advances in glass storage technology 2月 18, 2026 | Richard Black Project Silica introduces new techn…