Publication Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale Kunal Jain, A. Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Ruehle, Saravan Rajmohan, Shashwat Jaiswal, Yogesh Simmhan, Anoop Kulkarni, Steve Kofsky ACM Sigmetrics 2026 | June 2026 Project
Publication DroidSpeak: Efficient Context Sharing for Multiple-LLM Inference Yuhan Liu, Yuyang Huang, Jiayi Yao, Zhuohan Gu, Kuntai Du, Hanchen Li, Yihua Cheng, Junchen Jiang, Shan Lu, Madan Musuvathi, Esha Choukse NSDI | May 2026 Project
Publication Multimodal AI generates virtual population for tumor microenvironment modeling Jeya Maria Jose Valanarasu, Hanwen Xu, Naoto Usuyama, Chanwoo Kim, Cliff Wong, Peniel Argaw, Racheli Ben Shimol, Angela Crabtree, Kevin Matlock, Alexandra Q. Bartlett, Jaspreet Bagga, Yu Gu, Sheng Zhang, Tristan Naumann, Bernard A. Fox, Bill Wright, Ari Robicsek, Brian Piening, Carlo Bifulco, Sheng Wang, Hoifung Poon Cell | December 2025
Publication Astral Space: Convex Analysis at Infinity Miro Dudík, Robert E. Schapire, Matus Telgarsky December 2025 Final pre-publication draft of book to be published by Princeton University Press in 2026.
Publication SimSort: A Data-Driven Framework for Spike Sorting by Large-Scale Electrophysiology Simulation Yimu Zhang, Dongqi Han, Yansen Wang, Zhenning Lv, Yu Gu, Dongsheng Li NeurIPS 2025 | December 2025
Publication Improved Algorithms for Fair Matroid Submodular Maximization Sepideh Mahabadi, Sherry Sarkar, Jakub Tarnawski NeurIPS 2025 | December 2025
Publication Lost in Transmission: When and Why LLMs Fail to Reason Globally Tobias Schnabel, Kiran Tomlinson, Adith Swaminathan, Jennifer Neville NeurIPS 2025 | December 2025 NeurIPS Spotlight Github
Publication Reviving DSP for Advanced Theorem Proving in the Era of Reasoning Models Chenrui Cao, Liangcheng Song, Zenan Li, Xinyi Le, Xian Zhang, Hui Xue, Fan Yang NeurIPS 2025 | December 2025
Publication MeshAgent: Enabling Reliable Network Management with Large Language Models Yajie Zhou, Kevin Hsieh, Sathiya Kumaran Mani, Srikanth Kandula, Zaoxing Liu SIGMETRICS’26 | December 2025
Publication Distilled Decoding 2: One-step Sampling of Image Auto-regressive Models with Conditional Score Distillation Enshu Liu, Qian Chen, Xuefei Ning, Shengen Yan, Guohao Dai, Zinan Lin, Yu Wang NeurIPS 2025 | December 2025