Publication Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale Kunal Jain, A. Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Ruehle, Saravan Rajmohan, Shashwat Jaiswal, Yogesh Simmhan, Anoop Kulkarni, Steve Kofsky ACM Sigmetrics 2026 | June 2026 Project
Publication Semantic Caching for Low-Cost LLM Serving: From Offline Learning to Online Adaptation Xutong Liu, Baran Atalar, Xiangxiang Dai, Jinhang Zuo, Siwei Wang, John C.S. Lui, Wei Chen, Carlee Joe-Wong IEEE International Conference on Computer Communications (INFOCOM) | May 2026
Publication DroidSpeak: Efficient Context Sharing for Multiple-LLM Inference Yuhan Liu, Yuyang Huang, Jiayi Yao, Zhuohan Gu, Kuntai Du, Hanchen Li, Yihua Cheng, Junchen Jiang, Shan Lu, Madan Musuvathi, Esha Choukse NSDI | May 2026 Project
Publication EgoBrain: Synergizing Minds and Eyes For Human Action Understanding Nie Lin, Yansen Wang, Dongqi Han, Weibang Jiang, Jingyuan Li, Ryosuke Furuta, Yoichi Sato, Dongsheng Li 2026 International Conference on Learning Representations | April 2026
Publication Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs Zhihe Yang, Xufang Luo, Zilong Wang, Dongqi Han, Zhiyuan He, Dongsheng Li, Yunjian Xu ICLR 2026 | April 2026
Publication Algorithm Generation via Creative Ideation Ruiying Ma, Chieh-Jan Mike Liang, Yanjie Gao, Francis Y. Yan ICLR (International Conference on Learning Representations) | April 2026
Publication VidGuard-R1: AI-Generated Video Detection and Explanation via Reasoning MLLMs and RL Kyoungjun Park, Yifan Yang, Juheon Yi, Shicheng Zheng, Yifei Shen, Dongqi Han, Caihua Shan, Muhammad Muaz, Lili Qiu 2026 International Conference on Learning Representations | April 2026
Publication Parallel Sampling from Masked Diffusion Models via Conditional Independence Testing Iskander Azangulov, Teodora Pandeva, Niranjani Prasad, Javier Zazo, Sushrut Karmalkar April 2026
Publication Forward-Learned Discrete Diffusion: Learning how to noise to denoise faster Grigory Bartosh, Teodora Pandeva, Sushrut Karmalkar, Javier Zazo ICLR 2026 | April 2026
Publication Learning to Generate Unit Test via Adversarial Reinforcement Learning Dongjun Lee, Changho Hwang, Kimin Lee 2026 International Conference on Learning Representations | April 2026