Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
PARIKSHA: A Scalable, Democratic, Transparent Evaluation Platform for Assessing Indic Large Language Models
Publication Studies in Massively Speaker-Specific Speech Recognition Yu Shi, Eric Chang IEEE | May 2004
Publication Refining Segmental Boundaries for TTS database Using Fine Contextual-Dependent Boundary Models Lijuan Wang, Yong Zhao, Min Chu, Jianlai Zhou, Zhigang Cao May 2004
Publication Logistic Discriminative Speech Detection using Posterior SNRs Arun C. Surendran, Somsak Sukuttanon, John Platt May 2004
Publication Noise Robust Speech Recognition with a Switching Linear Dynamic Model Jasha Droppo, Alex Acero Proc. ICASSP | May 2004 Access
Publication Convolutional Networks for Speech Detection Somsak Sukittanon, Arun C. Surendran, John Platt, Chris J.C. Burges International Speech Communication Association | May 2004
Publication Capturing Long Distance Dependency in Language Modeling: An Empirical Study Jianfeng Gao, Hisami Suzuki International Conference on Natural Language Processing | May 2004
Publication Custom Arithmetic for High-speed, Low-resource ASR Systems Jonathan Malkin, Xiao Li, Jeff Bilmes IEEE International Conference on Acoustic, Speech and Signal Processing | May 2004
Publication Tone Articulation Modeling for Mandarin Spontaneous Speech Recognition Jian-lai Zhou, Ye Tian, Yu Shi, Chao Huang, Eric Chang IEEE | May 2004
Publication What’s in a translation rule? Michel Galley, Mark Hopkins, Kevin Knight, Daniel Marcu Proc. of HLT-NAACL | May 2004