论文与出版物 Semantic Caching for Low-Cost LLM Serving: From Offline Learning to Online Adaptation Xutong Liu, Baran Atalar, Xiangxiang Dai, Jinhang Zuo, Siwei Wang, John C.S. Lui, Wei Chen, Carlee Joe-Wong IEEE International Conference on Computer Communications (INFOCOM) | May 2026
论文与出版物 Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs Zhihe Yang, Xufang Luo, Zilong Wang, Dongqi Han, Zhiyuan He, Dongsheng Li, Yunjian Xu ICLR 2026 | April 2026
论文与出版物 Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective Siwei Wang, Yifei Shen, Haoran Sun, Shi Feng, Shang-Hua Teng, Li Dong, Yaru Hao, Wei Chen Proceedings of the 14th International Conference on Learning Representations (ICLR) | April 2026
论文与出版物 Combinatorial Rising Bandits Seockbean Song, Youngsik Yoon, Siwei Wang, Wei Chen, Jungseul Ok Proceedings of the 14th International Conference on Learning Representations (ICLR) | April 2026
论文与出版物 Lipschitz Bandits with Stochastic Delayed Feedback Zhongxuan Liu, Yue Kang, Thomas C. M. Lee 2026 International Conference on Learning Representations | April 2026
论文与出版物 Beyond Correctness: Learning Robust Reasoning via Transfer Hyunseok Lee, Soheil Abbasloo, Jihoon Tack, Jinwoo Shin February 2026
论文与出版物 Welfarist Formulations for Diverse Similarity Search Siddharth Barman, Nirjhar Das, Shivam Gupta, Kirankumar Shiragur ICLR 2026 | February 2026
论文与出版物 Composable Coresets for Constrained Determinant Maximization and Beyond Sepideh Mahabadi, Thuy-Duong Vuong AISTATS | January 2026 Spotlight
论文与出版物 Sublinear Metric Steiner Forest via Maximal Independent Set Sepideh Mahabadi, Mohammad Roghani, Jakub Tarnawski, Ali Vakilian Symposium on Discrete Algorithms (SODA) | January 2026