Publication Making Every Frame Matter: Continuous Video Understanding for Large Models via Adaptive State Modeling Hao Wu, Donglin Bai, Shiqi Jiang, Qianxi Zhang, Yifan Yang, Ting Cao, Fengyuan Xu October 2024 Project
Publication SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs Yizhao Gao, Zhichen Zeng, Dayou Du, Shijie Cao, Hayden Kwok-Hay So, Ting Cao, Fan Yang, Mao Yang October 2024
Publication Towards Graph Foundation Models: Training on Knowledge Graphs Enables Transferability to General Graphs Kai Wang, Siqiang Luo, Caihua Shan, Yifei Shen NeurIPS 2025 | October 2024
Publication Differential Transformer Tianzhu Ye, Li Dong, Yuqing Xia, Yutao Sun, Yi Zhu, Gao Huang, Furu Wei MSR-TR-2024-42 | October 2024 Published by Microsoft
Publication WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation Zhaojian Yu, Xin Zhang, Ning Shang, Yangyu Huang, Can Xu, Yishujie Zhao, Wenxiang Hu, Qiufeng Yin 2024 Meeting of the Association for Computational Linguistics | October 2024
Publication Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation Shaonan Wu, Shuai Lu, Yeyun Gong, Nan Duan, Ping Wei October 2024 October 2024
Publication IGOR: Image-GOal Representations are the Atomic Control Units for Foundation Models in Embodied AI Xiaoyu Chen, Junliang Guo, Tianyu He, Chuheng Zhang, Pushi Zhang, Derek Yang, Li Zhao, Jiang Bian October 2024 Project
Publication Multimodal Large Language Models Make Text-to-Image Generative Models Align Better Xun Wu, Shaohan Huang, Furu Wei October 2024
Publication Scaling the Codebook Size of VQ-GAN to 100,000 with a Utilization Rate of 99% Fangyun Wei, Dong Chen October 2024