Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
PARIKSHA: A Scalable, Democratic, Transparent Evaluation Platform for Assessing Indic Large Language Models
Publication Speech Technology and Systems in Human-Machine Communication Li Deng, Kuansan Wang, Wu Chou IEEE Signal Processing Magazine | September 2005, Vol 22(5): pp. 12-14
Publication Learning Statistically Characterized Resonance Targets in a Hidden Trajectory Model of Speech Coarticulation and Reduction Li Deng, Dong Yu, Alex Acero Proc. of the Interspeech Conference | September 2005 Proc. of the Interspeech Conference
Publication Phonetic Transcription Verification with Generalized Posterior Probability Lijuan Wang, Yong Zhao, Min Chu, Frank Soong, Zhigang Cao INTERSPEECH 2005 | September 2005 INTERSPEECH 2005 Project
Publication Maximum Mutual Information SPLICE Transform for Seen and Unseen Conditions Jasha Droppo, Alex Acero Proc. Interspeech Conference | September 2005 Proc. Interspeech Conference
Publication Evaluation of a Long-Contextual-Span Hidden Trajectory Model and Phonetic Recognizer Using A* Lattice Search Dong Yu, Li Deng, Alex Acero Proc. of the Interspeech Conference | September 2005 Proc. of the Interspeech Conference
Publication Refining Phoneme Segmentations Using Speaker-Adaptive Context Dependent Boundary Models Yong Zhao, Lijuan Wang, Min Chu, Frank Soong, Zhigang Cao INTERSPEECH 2005 | September 2005 Project
Publication Hidden Conditional Random Fields for Phone Classification John Platt, Asela Gunawardana, Milind Mahajan, Alex Acero International Conference on Speech Communication and Technology | September 2005 International Conference on Speech Communication and Technology
Publication Semiautomatic Improvements of System-Initiative Spoken Dialog Applications Using Interactive Clustering Dong Yu, Alex Acero IEEE Trans. Speech & Audio Proc (Special Issue on Data Mining of Speech, Audio and Dialog) | September 2005 Project Project
Publication Indexing Uncertainty for Spoken Document Search Ciprian Chelba, Alex Acero Proc. of the Interspeech Conference | September 2005 Proc. of the Interspeech Conference
Publication A Graphical Model for Multi-Sensory Speech Processing in Air-and-Bone Conductive Microphones A. Subramanya, Jasha Droppo, Alex Acero, Zheng Zhang, Zicheng Liu Proc. of the Interspeech Conference | September 2005 Proc. of the Interspeech Conference