Publication PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning Yizhen Zhang, Yang Ding, Shuoshuo Zhang, Xinchen Zhang, Haoling Li, Zhong-zhi Li, Peijie Wang, Jie Wu, Lei Ji, Yelong Shen, Yujiu Yang, Yeyun Gong NeurIPS 2025 | June 2025
Publication Direct Reasoning Optimization: Constrained RL with Token-Level Dense Reward and Rubric-Gated Constraints for Open-ended Tasks Yifei Xu, Tusher Chakraborty, Srinagesh Sharma, Leonardo Nunes, Swati Sharma, Kate Drakos Demopulos, Emre Kiciman, Songwu Lu, Ranveer Chandra Arxiv | June 2025
Publication Self-Enhancing Video Data Management System for Compositional Events with Large Language Models Enhao Zhang, Nicole Sullivan, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska Proceedings of the ACM on Management of Data | June 2025, Vol 3: pp. 1-29
Publication Principal Type Inference under a Prefix Daan Leijen, Wenjia Ye PLDI'25 | June 2025 Distinguished Paper "A Fresh Look at Static Overloading". See also the accompanying technical report. Project
Publication Screen Reader Users in the Vibe Coding Era: Adaptation, Empowerment, and New Accessibility Landscape Nan Chen, Luna K. Qiu, Arran Zeyu Wang, Zilong Wang, Yuqing Yang ArXiv | June 2025, Vol abs/2506.13270
Publication Unveiling the Learning Mind of Language Models: A Cognitive Framework and Empirical Study Zhengyu Hu, Jianxun Lian, Zheyuan Xiao, Seraphina Zhang, Tianfu Wang, Nicholas Jing Yuan, Xing Xie, Hui Xiong NeurIPS 2025 | June 2025
Publication The SWE-Bench Illusion: When State-of-the-Art LLMs Remember Instead of Reason Shanchao Liang, Spandan Garg, Roshanak Zilouchian Moghaddam ArXiv | June 2025, Vol abs/2506.12286
Publication Implicit Language Models are RNNs: Balancing Parallelization and Expressivity Mark Schöne, Babak Rahmani, Heiner Kremer, Fabian Falck, Hitesh Ballani, Jannes Gladrow ICML 2025 | June 2025 Github Project
Publication From Replication to Redesign: Exploring Pairwise Comparisons for LLM-Based Peer Review Yaohui Zhang, Haijing Zhang, Wenlong Ji, Tianyu Hua, Nick Haber, Hancheng Cao, Weixin Liang NeurIPS 2025 | June 2025
Publication MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning Yuxuan Luo, Yuhui Yuan, Junwen Chen, Haonan Cai, Ziyi Yue, Yuwei Yang, Fatima Zohra Daha, Ji Li, Zhouhui Lian NeurIPS 2025 | June 2025