Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
PARIKSHA: A Scalable, Democratic, Transparent Evaluation Platform for Assessing Indic Large Language Models
Publication Integrating Multiple Knowledge Sources for Utterance-Level Confidence Annotation in the CMU Communicator Spoken Dialog System Dan Bohus, Alexander I. Rudnicky CMU-CS-02-190 | November 2002 University of Washington Computer Science & Engineering Technical Report
Publication A System for Spoken Query Information Retrieval on Mobile Devices Eric Chang, Frank Seide, Helen M. Meng, Zhuoran Chen, Yu Shi, Yuk-Chi Li IEEE Transactions on Speech and Audio Processing | October 2002, Vol 10(8): pp. 531-541
Publication Automatic Speech Recognition for Wireless Mobile Devices. Richard C. Rose, Sarangarajan Parthasarathy September 2002
Publication Unsupervised speaker segmentation of telephone conversations. Aaron E. Rosenberg, Allen Gorin, Zhu Liu, Sarangarajan Parthasarathy ICSLP 2002 | September 2002
Publication A Multi-Class Approach for Modelling Out-of-Vocabulary Words Proc. Int. Conf. on Spoken Language Processing | September 2002 Proc. Int. Conf. on Spoken Language Processing
Publication Log-Domain Speech Feature Enhancement Using Sequential MAP Noise Estimation and a Phase-sensitive Model of the Acoustic Environment Li Deng, Jasha Droppo, Alex Acero Proc. International Conference on Spoken Language Processing | September 2002 Proc. International Conference on Spoken Language Processing
Publication Overcoming Language Barriers in the Internet Era – A Foreign Language Reading Assistance System Hang Li, Yunbo Cao, Cong Li MSR-TR-2002-91 | September 2002
Publication A syllable-based approach for improved recognition of spoken names Abhinav Sethy, Shrikanth Narayanan, Sarangarajan Parthasarathy Pronunciation Modeling and Lexicon Adaptation for Spoken Language Technology | September 2002