Publication EgoMemory: Memory-Augmented Personalized Retrieval for Long-Context Egocentric Video Yuanmin Tang, Jue Zhang, Xiaoting Qin, Jing Yu, Meikang Qiu, Gaopeng Gou, Gang Xiong, Qingwei Lin 林庆维, Saravan Rajmohan, Dongmei Zhang, Qi Wu ACL Findings | July 2026
Publication CI-Work: Benchmarking Contextual Integrity in Enterprise LLM Agents Wenjie Fu, Xiaoting Qin, Jue Zhang, Qingwei Lin 林庆维, Lukas Wutschitz, Robert Sim, Saravan Rajmohan, Dongmei Zhang ACL Industry Track | July 2026
Publication Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale Kunal Jain, A. Parayil, Ankur Mallick, Rujia Wang, Renee St. Amant, Chetan Bansal, Victor Ruehle, Saravan Rajmohan, Shashwat Jaiswal, Yogesh Simmhan, Anoop Kulkarni, Steve Kofsky ACM Sigmetrics 2026 | June 2026 Project
Publication iSHIFT: Lightweight Slow-Fast GUI Agent with Adaptive Perception Sarthak Mehrotra, S. V. Rebbapragada, Mani Hemanth Reddy Bonthu, Vineeth N Balasubramanian 2026 Computer Vision and Pattern Recognition | June 2026
Publication Source Models Leak What They Shouldn’t: Unlearning Zero-Shot Transfer in Domain Adaptation Through Adversarial Optimization Arnav Devalapally, Poornima Jain, Kartik Srinivas, Vineeth N Balasubramanian 2026 Computer Vision and Pattern Recognition | June 2026
Publication Understanding Task Transfer in Vision-Language Models Bhuvan Sachdeva, Karan Uppal, Abhinav Java, Vineeth N Balasubramanian 2026 Computer Vision and Pattern Recognition | June 2026
Publication Foundation Model Priors Enhance Object Focus in Feature Space for Source-Free Object Detection S. V. Rebbapragada, Rishabh Lalla, Aveen Dayal, Tejal Kulkarni, A. Lalla, Vineeth N Balasubramanian, Muhammad Haris Khan 2026 Computer Vision and Pattern Recognition | June 2026
Publication Semantic Caching for Low-Cost LLM Serving: From Offline Learning to Online Adaptation Xutong Liu, Baran Atalar, Xiangxiang Dai, Jinhang Zuo, Siwei Wang, John C.S. Lui, Wei Chen, Carlee Joe-Wong IEEE International Conference on Computer Communications (INFOCOM) | May 2026
Publication DroidSpeak: Efficient Context Sharing for Multiple-LLM Inference Yuhan Liu, Yuyang Huang, Jiayi Yao, Zhuohan Gu, Kuntai Du, Hanchen Li, Yihua Cheng, Junchen Jiang, Shan Lu, Madan Musuvathi, Esha Choukse NSDI | May 2026 Project
Publication Chow-Liu Ordering for Long-Context Reasoning in Chain-of-Agents Naman Gupta, Vaibhav Singh, Arun Iyer, Kirankumar Shiragur, Pratham Grover, Ramakrishna Bairi, Ritabrata Maiti, Sankarshan Damle, Shachee Mishra Gupta, Rishikesh Maurya, Vageesh D C International Conference on Learning Representations Workshop on Memory for LLM-Based Agentic Systems (MemAgents-ICLR) | April 2026