公開日 Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale Kunal Jain, A. Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Ruehle, Saravan Rajmohan, Shashwat Jaiswal, Yogesh Simmhan, Anoop Kulkarni, Steve Kofsky ACM Sigmetrics 2026 | June 2026 プロジェクト
公開日 iSHIFT: Lightweight Slow-Fast GUI Agent with Adaptive Perception Sarthak Mehrotra, S. V. Rebbapragada, Mani Hemanth Reddy Bonthu, Vineeth N Balasubramanian 2026 Computer Vision and Pattern Recognition | June 2026
公開日 Source Models Leak What They Shouldn’t: Unlearning Zero-Shot Transfer in Domain Adaptation Through Adversarial Optimization Arnav Devalapally, Poornima Jain, Kartik Srinivas, Vineeth N Balasubramanian 2026 Computer Vision and Pattern Recognition | June 2026
公開日 Understanding Task Transfer in Vision-Language Models Bhuvan Sachdeva, Karan Uppal, Abhinav Java, Vineeth N Balasubramanian 2026 Computer Vision and Pattern Recognition | June 2026
公開日 Foundation Model Priors Enhance Object Focus in Feature Space for Source-Free Object Detection S. V. Rebbapragada, Rishabh Lalla, Aveen Dayal, Tejal Kulkarni, A. Lalla, Vineeth N Balasubramanian, Muhammad Haris Khan 2026 Computer Vision and Pattern Recognition | June 2026
公開日 Semantic Caching for Low-Cost LLM Serving: From Offline Learning to Online Adaptation Xutong Liu, Baran Atalar, Xiangxiang Dai, Jinhang Zuo, Siwei Wang, John C.S. Lui, Wei Chen, Carlee Joe-Wong IEEE International Conference on Computer Communications (INFOCOM) | May 2026
公開日 DroidSpeak: Efficient Context Sharing for Multiple-LLM Inference Yuhan Liu, Yuyang Huang, Jiayi Yao, Zhuohan Gu, Kuntai Du, Hanchen Li, Yihua Cheng, Junchen Jiang, Shan Lu, Madan Musuvathi, Esha Choukse NSDI | May 2026 プロジェクト
公開日 Chow-Liu Ordering for Long-Context Reasoning in Chain-of-Agents Naman Gupta, Vaibhav Singh, Arun Iyer, Kirankumar Shiragur, Pratham Grover, Ramakrishna Bairi, Ritabrata Maiti, Sankarshan Damle, Shachee Mishra Gupta, Rishikesh Maurya, Vageesh D C International Conference on Learning Representations Workshop on Memory for LLM-Based Agentic Systems | April 2026
公開日 Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs Zhihe Yang, Xufang Luo, Zilong Wang, Dongqi Han, Zhiyuan He, Dongsheng Li, Yunjian Xu ICLR 2026 | April 2026
公開日 EgoBrain: Synergizing Minds and Eyes For Human Action Understanding Nie Lin, Yansen Wang, Dongqi Han, Weibang Jiang, Jingyuan Li, Ryosuke Furuta, Yoichi Sato, Dongsheng Li 2026 International Conference on Learning Representations | April 2026