Publication UI-Evol: Automatic Knowledge Evolving for Computer Use Agents Ziyun Zhang, Xinyi Liu, Xiaoyi Zhang, Jun Wang, Gang Chen, Yan Lu ArXiv | May 2025, Vol abs/2505.21964
Publication Text2Grad: Reinforcement Learning from Natural Language Feedback Hanyang Wang, Lu Wang, Chaoyun Zhang, Tianjun Mao, Si Qin, Qingwei Lin 林庆维, Saravan Rajmohan, Dongmei Zhang ICLR 2026 | May 2025
Publication Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO Lai Wei, Yuting Li, Chen Wang, Yue Wang, Linghe Kong, Weiran Huang, Lichao Sun NeurIPS 2025 | May 2025
Publication VeriTrail: Closed-Domain Hallucination Detection with Traceability Dasha Metropolitansky, Jonathan Larson ICLR 2026 | May 2025 Video
Publication Beyond Metrics: Evaluating LLMs’ Effectiveness in Culturally Nuanced, Low-Resource Real-World Scenarios Millicent Ochieng, Varun Gumma, Sunayana Sitaram, Jindong Wang, Vishrav Chaudhary, Keshet Ronen, Kalika Bali, Jacki O'Neill Association for Computational Linguistics (ACL 2025) | May 2025 Project
Publication Training Language Models to Generate Quality Code with Program Analysis Feedback Feng Yao, Zilong Wang, Liyuan Liu, Junxia Cui, Li Zhong, Xiaohan Fu, Haohui Mai, Vish Krishnan, Jianfeng Gao, Jingbo Shang NeurIPS 2025 | May 2025
Publication What Do Latent Action Models Actually Learn? Chuheng Zhang, Tim Pearce, Pushi Zhang, Kaixin Wang, Xiaoyu Chen, Wei Shen, Li Zhao, Jiang Bian NeurIPS 2025 | May 2025
Publication rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified Dataset Yifei Liu, Li Lyna Zhang, Yi Zhu, Bingcheng Dong, Xudong Zhou, Ning Shang, Fan Yang, Mao Yang NeurIPS 2025 | May 2025
Publication GenTool: Enhancing Tool Generalization in Language Models through Zero-to-One and Weak-to-Strong Simulation Jie He, Jennifer Neville, Mengting Wan, Longqi Yang, Hui Liu, Xiaofeng Xu, Xia Song, Jeff Z. Pan, Pei Zhou Annual Meeting of the Association for Computational Linguistics (ACL 2025) | May 2025
Publication Token-Importance Guided Direct Preference Optimization Ning Yang, Hai Lin, Yibo Liu, Baoliang Tian, Guoqing Liu, Haijun Zhang ICLR 2026 | May 2025