Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
PARIKSHA: A Scalable, Democratic, Transparent Evaluation Platform for Assessing Indic Large Language Models
Publication Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition Yuang Li, Yu Wu, Jinyu Li, Shujie Liu Workshop of Automatic Speech Recognition and Understanding | December 2023
Publication Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach Junkun Chen, Jian Xue, Peidong Wang, Jing Pan, Jinyu Li Workshop of Automatic Speech Recognition and Understanding | December 2023
Publication Multi Transcription-Style Speech Transcription Using Attention-based Encoder-decoder Model Yan Huang, Piyush Behre, Guoli Ye, Shawn Chang, Yifan Gong ASRU 2023 | December 2023
Publication Responsible AI Considerations in Text Summarization Research: A Review of Current Practices Yu Lu Liu, Meng Cao, Su Lin Blodgett, Jackie Chi Kit Cheung, Alexandra Olteanu, Adam Trischler Findings of EMNLP 2023 | December 2023
Publication MM-Reasoner: A Multi-Modal Knowledge-Aware Framework for Knowledge-Based Visual Question Answering Mahmoud Khademi, Ziyi Yang, Felipe Vieira Frujeri, Chenguang Zhu 2023 Empirical Methods in Natural Language Processing | December 2023
Publication On decoder-only architecture for speech-to-text and large language model integration Jian Wu, Yashesh Gaur, Zhuo Chen, Long Zhou, Yimeng Zhu, Tianrui Wang, Jinyu Li, Shujie Liu, Bo Ren, Linquan Liu, Yu Wu Workshop of Automatic Speech Recognition and Understanding | December 2023
Publication A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability Jian Xue, Peidong Wang, Jinyu Li, Eric Sun Workshop of Automatic Speech Recognition and Understanding | December 2023
Publication Large Search Model: Redefining Search Stack in the Era of LLMs Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei SIGIR Forum | December 2023, Vol 57(2)
Publication Large-Scale Streaming End-to-End Speech Translation Jinyu Li December 2023 Invited Talk at NTU and SJTU
Publication Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training Eric Sun, Jinyu Li, Yuxuan Hu, Yimeng Zhu, Long Zhou, Jian Xue, Peidong Wang, Linquan Liu, Shujie Liu, Edward Lin, Yifan Gong December 2023