Publication PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling Zefan Cai, Yichi Zhang, Bofei Gao, Yuliang Liu, Tianyu Liu, Keming Lu, Wayne Xiong, Yue Dong, Baobao Chang, Junjie Hu, Wen Xiao COLM 2025 | July 2025
Publication Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers Kusha Sareen, Morgane M Moss, Alessandro Sordoni, Rishabh Agarwal, Arian Hosseini COLM 2025 | July 2025
Publication Scaling Laws of Synthetic Data for Language Models Zeyu Qin, Qingxiu Dong, Xingxing Zhang, Li Dong, Xiaolong Huang, Ziyi Yang, Mahmoud Khademi, Dongdong Zhang, Hany Hassan Awadalla, Yi R. Fung, Weizhu Chen, Minhao Cheng, Furu Wei COLM 2025 | July 2025
Publication Closed-loop optimization using machine learning for the accelerated design of sustainable cements incorporating algal biomatter Meng-Yen Lin, Kristen Severson, Paul Grandgeorge, Eleftheria Roumeli Matter | July 2025
Publication Exploring Sparse Adapters for Scalable Merging of Parameter Efficient Experts Samin Yeasar Arnob, Zhan Su, Minseon Kim, Oleksiy Ostapenko, Riyasat Ohib, Esra'a Saleh, Doina Precup, Lucas Caccia, Alessandro Sordoni COLM 2025 | July 2025
Publication PoWER Never Corrupts: Tool-Agnostic Verification of Crash Consistency and Corruption Detection Hayley LeBlanc, Jay Lorch, Chris Hawblitzel, Cheng Huang, Yiheng Tao, Nickolai Zeldovich, Vijay Chidambaram USENIX Symposium on Operating Systems Design and Implementation (OSDI) | July 2025 Distinguished Artifact Award Project
Publication SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation Zichong Li, Chen Liang, Zixuan Zhang, Ilgee Hong, Young Jin Kim, Weizhu Chen, Tuo Zhao COLM 2025 | July 2025
Publication Rethinking Safety in LLM Fine-tuning: An Optimization Perspective Minseon Kim, Jin Myung Kwak, Lama Alssum, Bernard Ghanem, Philip H. S. Torr, David Krueger, Fazl Barez, Adel Bibi COLM 2025 | July 2025
Publication WaferLLM: Large Language Model Inference at Wafer Scale Congjie He, Yeqi Huang, Pei Mu, Ziming Miao, Jilong Xue, Lingxiao Ma, Fan Yang, Luo Mai OSDI 2025 | July 2025
Publication Training Plug-and-Play Knowledge Modules with Deep Context Distillation Lucas Caccia, Alan Ansell, E. Ponti, Ivan Vuli'c, Alessandro Sordoni COLM 2025 | July 2025