Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
PARIKSHA: A Scalable, Democratic, Transparent Evaluation Platform for Assessing Indic Large Language Models
Publication Robust access to large structured data using voice form-filling Sarangarajan Parthasarathy, Cyril Allauzen, R. Munkong Interspeech 2005 | September 2005
Publication Let’s Go Public! Taking a Spoken Dialog System to the Real World Antoine Raux, Brian Langner, Dan Bohus, Alan W Black, Maxine Eskenazi 9th European Conference on Speech Communication and Technology, Lisbon, Portugal | September 2005
Publication A Principled Approach for Rejection Threshold Optimization in Spoken Dialog Systems Dan Bohus, Alexander I. Rudnicky 9th European Conference on Speech Communication and Technology, Lisbon, Portugal | September 2005
Publication Sorry, I Didn’t Catch That! – An Investigation of Non-understanding Errors and Recovery Strategies Dan Bohus, Alexander I. Rudnicky 6th SIGdial Workshop on Discourse and Dialogue | September 2005
Publication Learning Statistically Characterized Resonance Targets in a Hidden Trajectory Model of Speech Coarticulation and Reduction Li Deng, Dong Yu, Alex Acero Proc. of the Interspeech Conference | September 2005 Proc. of the Interspeech Conference
Publication Speech Technology and Systems in Human-Machine Communication Li Deng, Kuansan Wang, Wu Chou IEEE Signal Processing Magazine | September 2005, Vol 22(5): pp. 12-14
Publication Phonetic Transcription Verification with Generalized Posterior Probability Lijuan Wang, Yong Zhao, Min Chu, Frank Soong, Zhigang Cao INTERSPEECH 2005 | September 2005 INTERSPEECH 2005 Project
Publication Maximum Mutual Information SPLICE Transform for Seen and Unseen Conditions Jasha Droppo, Alex Acero Proc. Interspeech Conference | September 2005 Proc. Interspeech Conference
Publication Evaluation of a Long-Contextual-Span Hidden Trajectory Model and Phonetic Recognizer Using A* Lattice Search Dong Yu, Li Deng, Alex Acero Proc. of the Interspeech Conference | September 2005 Proc. of the Interspeech Conference
Publication Refining Phoneme Segmentations Using Speaker-Adaptive Context Dependent Boundary Models Yong Zhao, Lijuan Wang, Min Chu, Frank Soong, Zhigang Cao INTERSPEECH 2005 | September 2005 Project