Adapting grapheme-to-phoneme conversion for name recognition

Xiao Li; Alex Acero; Asela Gunawardana

Adapting grapheme-to-phoneme conversion for name recognition

Xiao Li ,
Alex Acero ,
Asela Gunawardana

IEEE Workshop on Automatic Speech Recognition and Understanding | December 2007

Published by Institute of Electrical and Electronics Engineers, Inc.

Download BibTex

This work investigates the use of acoustic data to improve grapheme-to-phoneme conversion for name recognition. We introduce a joint model of acoustics and graphonemes, and present two approaches, maximum likelihood training and discriminative training, in adapting graphoneme model parameters. Experiments on a large-scale voice-dialing system show that the maximum likelihood approach yields a relative 7% reduction in SER compared to the best baseline result we obtained without leveraging acoustic data, while discriminative training enlarges the SER reduction to 12%.

© 2007 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.