Publication Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely Siyun Zhao, Yuqing Yang, Zilong Wang, Zhiyuan He, Luna K. Qiu, Lili Qiu September 2024
Publication Uncover Nested Data Parallelism and Data Reuse in DNN Computation with FractalTensor Ying Cao, Fan Yang, Mao Yang September 2024
Publication Compositional 3D-aware Video Generation with LLM Director Hanxin Zhu, Tianyu He, Anni Tang, Junliang Guo, Zhibo Chen, Jiang Bian NeurIPS 2024 | August 2024
Publication Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers Zhenting Qi, Mingyuan Ma, Jiahang Xu, Li Lyna Zhang, Fan Yang, Mao Yang ICLR 2025 | August 2024
Publication LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration Zhiwen Mo, Lei Wang, Jianyu Wei, Zhichen Zeng, Shijie Cao, Lingxiao Ma, Naifeng Jing, Ting Cao, Jilong Xue, Fan Yang, Mao Yang August 2024 Project
Publication VulLibGen: Generating Names of Vulnerability-Affected Packages via a Large Language Model Tianyu Chen, Lin Li, Liuchuan Zhu, Zongyang Li, Xueqing Liu, Guangtai Liang, Qianxiang Wang, Tao Xie ACL 2024 | August 2024
Publication Scaling Deep Learning Computation over the Inter-Core Connected Intelligence Processor with T10 Yiqi Liu, Yuqi Xue, Yu Cheng, Lingxiao Ma, Ziming Miao, Jilong Xue, Jian Huang SOSP 2024 | August 2024
Publication Uncovering Milestone Papers: A Network Diffusion and Game Theory Approach Wei Zhang, Juyang Cao, Manuel Sebastian Mariani, Zhen-Zhen Wang, Mingyang Zhou, Wei Chen, Hao Liao Journal of Informetrics | August 2024, Vol 18(3)
Publication LordNet: An efficient neural network for learning to solve parametric partial differential equations without simulated data Xinquan Huang, Wenlei Shi, Xiaotian Gao, Xinran wei, Jia Zhang, Jiang Bian, Mao Yang, Tie-Yan Liu Neural Networks | August 2024
Publication Q-Sparse: All Large Language Models can be Fully Sparsely-Activated Hongyu Wang, Shuming Ma, Ruiping Wang, Furu Wei July 2024