Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
PARIKSHA: A Scalable, Democratic, Transparent Evaluation Platform for Assessing Indic Large Language Models
Publication The Adaptation Of A Machine-Learned Sentence Realization System To French Martine Smets, Michael Gamon, Simon Corston-Oliver, Eric Ringger April 2003
Publication An Expectation-Maximization Approach for Formant Tracking using a Parameter-free Nonlinear Predictor Issam Bazzi, Alex Acero, Li Deng Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing | April 2003
Publication Incremental Bayes Learning with Prior Evolution for Tracking Non-Stationary Noise Statistics from Noisy Speech Data Li Deng, Jasha Droppo, Alex Acero Proc. ICASSP | April 2003
Publication Speech Error Correction: The Story of the Alternates List Kevin Larson, David Mowatt International Journal of Speech Technology | March 2003, Vol 6(2): pp. 183-194
Publication ProAlign: Shared Task Description Dekang Lin, Colin Cherry Proceedings of the HLT/NAACL 2003 Workshop on Building and Using Parallel Texts | January 2003
Publication Coarticulation Modeling by Embedding a Target-Directed Hidden Trajectory Model into HMM-Model and Training Frank Seide, Li Deng, Jianlai Zhou Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) | January 2003
Publication Toward Domain-Independent Conversational Speech Recognition Brian Kingsbury, Lidia Mangu, George Saon, Geoffrey Zweig, Scott Axelrod, Vaibhava Goel, Karthik Visweswariah, Michael Picheny Proceedings of Eurospeech | January 2003
Publication Learning with Knowledge from Multiple Experts Matthew Richardson, Pedro Domingos Proceedings of the Twentieth International Conference on Machine Learning | January 2003 Proceedings of the Twentieth International Conference on Machine Learning
Publication Phonetic Class-Based Speaker Verification Matthieu Hebert, Larry Heck Eurospeech 2003 – Interspeech 2003 | January 2003