Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
PARIKSHA: A Scalable, Democratic, Transparent Evaluation Platform for Assessing Indic Large Language Models
Publication Comparison of Sentential-Stress Allocation within Base Phrase among Different Reading Styles Min Chu, Mingzhen Bao Proc. of International Conference on Speech Prosody | March 2004
Publication A Hybrid Approach to Rendering Handwritten Characters Sara L. Su, Chenyu Wu, Ying-Qing Xu, Heung-Yeung Shum Proceedings of WSCG | February 2004
Publication Initial Development of a Voice-Activated Astronaut Assistant for Procedural Tasks: From Need to Concept to Prototype Gregory Aist, Dan Bohus, Brad Boven, Ellen Campana, Susana Early, Steven Phan Interactive Instruction Development | January 2004, Vol 16(3): pp. 32-36
Publication Error Awareness and Recovery in Task-Oriented Spoken Dialogue Systems Dan Bohus January 2004 January 2004
Publication Advances in Large Vocabulary Speech Recognition Geoffrey Zweig MSR-TR-2004-154 | January 2004 Advances in Computers, Elsevier Science
Publication Arc Minimization in Finite State Decoding Graphs with Cross-Word Decoding Context Geoffrey Zweig MSR-TR-2004-153 | January 2004 Computer Speech and Language. Vol. 18, 2004
Publication Use of metadata to improve recognition of spontaneous speech and named entities Bhuvana Ramabhadran, Olivier Siohan, Geoffrey Zweig In Proceedings of ICSLP | January 2004
Publication Speech Recognition Error Analysis on the English MALACH Corpus Olivier Siohan, Bhuvana Ramabhadran, Geoffrey Zweig In Proceedings of ICSLP | January 2004
Publication A study on the effects of limited training data for English, Spanish and Indonesian keyword spotting Kit Thambiratnam, T. Martin, S. Sridharan 10th Australian International Conference on Speech Science and Technology (SST), Proceedings of | January 2004