Publication SuperBench: Improving Cloud AI Infrastructure Reliability with Proactive Validation Yifan Xiong, Yuting Jiang, Ziyue Yang, Lei Qu, Guoshuai Zhao, Shuguang Liu, Dong Zhong, Boris Pinzur, Jie Zhang, Yang Wang, Jithin Jose, Hossein Pourreza, Jeff Baxter, Kushal Datta, Prabhat Ram, Luke Melton, Joe Chau, Peng Cheng, Yongqiang Xiong, Lidong Zhou USENIX ATC | July 2024 Best Paper Award
Publication Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference Ranggi Hwang, Jianyu Wei, Shijie Cao, Changho Hwang, Xiaohu Tang, Ting Cao, Mao Yang ISCA 2024 | July 2024 Microsoft Research Focus https://www.microsoft.com/en-us/research/blog/research-focus-week-of-july-15-2024/
Publication VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment Bing Han, Long Zhou, Shujie Liu, Sanyuan Chen, Lingwei Meng, Yanming Qian, Yanqing Liu, Sheng Zhao, Jinyu Li, Furu Wei June 2024 Project
Publication Direct Preference Knowledge Distillation for Large Language Models Yixing Li, Yuxian Gu, Li Dong, Dequan Wang, Yu Cheng, Furu Wei June 2024
Publication Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions Minghan Li, Heng Li, Zhi-Qi Cheng, Yifei Dong, Yuxuan Zhou, Jun-Yan He, Qi Dai, Teruko Mitamura, Alexander G. Hauptmann NeurIPS 2024 | June 2024
Publication T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge Jianyu Wei, Shijie Cao, Ting Cao, Lingxiao Ma, Lei Wang, Yanyong Zhang, Mao Yang June 2024 Project
Publication Instruction Pre-Training: Language Models are Supervised Multitask Learners Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei June 2024
Publication SeD: Semantic-Aware Discriminator for Image Super-Resolution Bingchen Li, Xin Li, Hanxin Zhu, Yeying Jin, Ruoyu Feng, Zhizheng Zhang, Zhibo Chen The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024 | June 2024
Publication Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms Miaosen Zhang, Yixuan Wei, Zhen Xing, Yifei Ma, Zuxuan Wu, Ji Li, Zheng Zhang, Qi Dai, Chong Luo, Xin Geng, Baining Guo NeurIPS 2024 | June 2024
Publication VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers Sanyuan Chen, Shujie Liu, Long Zhou, Yanqing Liu, Xu Tan, Jinyu Li, Sheng Zhao, Yao Qian, Furu Wei June 2024 Project