Microsoft at ASPLOS 2024: Advancing hardware and software for high-scale, secure, and efficient modern applications
论文与出版物 Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale Kunal Jain, A. Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Ruehle, Saravan Rajmohan, Shashwat Jaiswal, Yogesh Simmhan, Anoop Kulkarni, Steve Kofsky ACM Sigmetrics 2026 | June 2026 项目
论文与出版物 DroidSpeak: Efficient Context Sharing for Multiple-LLM Inference Yuhan Liu, Yuyang Huang, Jiayi Yao, Zhuohan Gu, Kuntai Du, Hanchen Li, Yihua Cheng, Junchen Jiang, Shan Lu, Madan Musuvathi, Esha Choukse NSDI | May 2026 项目
论文与出版物 Harvesting Spare CPU Resources in Container Systems Adam Hall, Anirudh Sarma, Esha Choukse, Kishore Ramachandran, Sameh Elnikety NSDI | May 2026
论文与出版物 Concord: Learning Network Configuration Contracts Ryan Beckett, Francis Y. Yan, Raghunadha Reddy Pocha, Vineesh V. Raj, Ayyub Shaik, Siva Kesava Reddy Kakarla 2026 European Conference on Computer Systems | April 2026
论文与出版物 Algorithm Generation via Creative Ideation Ruiying Ma, Chieh-Jan Mike Liang, Yanjie Gao, Francis Y. Yan ICLR (International Conference on Learning Representations) | April 2026
论文与出版物 VeriStruct: AI-assisted Automated Verification of Data-Structure Modules in Verus Chuyue Sun, Yican Sun, Ethan Zhang, Daneshvar Amrollahi, Shuvendu Lahiri, Shan Lu, David Dill, Clark Barrett International Conference on Tools and Algorithms for the Construction and Analysis of Systems (TACAS) | April 2026 项目
论文与出版物 NetArena: Dynamic Benchmarks for AI Agents in Network Automation Yajie Zhou, Jiajun Ruan, Eric S. Wang, Sadjad Fouladi, Francis Y. Yan, Kevin Hsieh, Zaoxing Liu ICLR 2026 | April 2026 Ranked 1st on the "Coding Agent" benchmark in Berkeley's AgentX competition
论文与出版物 QoServe : Breaking the Silos of LLM Inference Serving Kanishk Goel, Jayashree Mohan, Nipun Kwatra, Ravi Shreyas Anupindi, Ramachandran Ramjee Architectural Support for Programming Languages and Operating Systems (ASPLOS) 2026 | March 2026 项目
论文与出版物 MSCCL++: Rethinking GPU Communication Abstractions for AI Inference Changho Hwang, Peng Cheng, Roshan Dathathri, Abhinav Jangda, Saeed Maleki, Madan Musuvathi, Olli Saarikivi, Aashaka Shah, Ziyue Yang, Binyang Li, Caio Rocha, Qinghua Zhou, Mahdieh Ghazimirsaeed, Sreevatsa Anantharamu, Jithin Jose ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) | March 2026
微软研究院博客 Project Silica’s advances in glass storage technology 2026年2月18日 | Richard Black Project Silica introduces new techn…