Publication Exploring Sparse Adapters for Scalable Merging of Parameter Efficient Experts Samin Yeasar Arnob, Zhan Su, Minseon Kim, Oleksiy Ostapenko, Riyasat Ohib, Esra'a Saleh, Doina Precup, Lucas Caccia, Alessandro Sordoni COLM 2025 | July 2025
Publication SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation Zichong Li, Chen Liang, Zixuan Zhang, Ilgee Hong, Young Jin Kim, Weizhu Chen, Tuo Zhao COLM 2025 | July 2025
Publication Rethinking Safety in LLM Fine-tuning: An Optimization Perspective Minseon Kim, Jin Myung Kwak, Lama Alssum, Bernard Ghanem, Philip H. S. Torr, David Krueger, Fazl Barez, Adel Bibi COLM 2025 | July 2025
Publication WaferLLM: Large Language Model Inference at Wafer Scale Congjie He, Yeqi Huang, Pei Mu, Ziming Miao, Jilong Xue, Lingxiao Ma, Fan Yang, Luo Mai OSDI 2025 | July 2025
Publication Warbler: Speculative Distributed Transactions with Geo-Replication Weihai Shen, Yang Cui, Siddhartha Sen, Sebastian Angel, Shuai Mu OSDI 2025 | July 2025
Publication Kamino: Efficient VM Allocation at Scale with Latency-Driven Cache-Aware Scheduling David Domingo, Hugo Barbalho, Marco Molinaro, Kuan Liu, Abhisek Pan, David Dion, Thomas Moscibroda, Sudarsun Kannan, Ishai Menache OSDI 2025 | July 2025
Publication Training Plug-and-Play Knowledge Modules with Deep Context Distillation Lucas Caccia, Alan Ansell, E. Ponti, Ivan Vuli'c, Alessandro Sordoni COLM 2025 | July 2025
Publication SecurityLingua: Efficient Defense of LLM Jailbreak Attacks via Security-Aware Prompt Compression Yucheng Li, Surin Ahn, Huiqiang Jiang, Amir H. Abdi, Yuqing Yang, Lili Qiu COLM 2025 | July 2025
Publication REFA: Reference Free Alignment for multi-preference optimization Taneesh Gupta, Rahul Madhavan, Xuchao Zhang, Chetan Bansal, Saravan Rajmohan COLM 2025 | July 2025
Publication Artificial Intelligence and other Speculative Metaphors Mark Blythe, Siân Lindley, Dave Murray-Rust Proceedings of the 2025 ACM Designing Interactive Systems Conference | July 2025, pp. 347-356