Generating Complex Morphology for Machine Translation

Einat Minkov; Kristina Toutanova; Hisami Suzuki

Generating Complex Morphology for Machine Translation

Einat Minkov ,
Kristina Toutanova ,
Hisami Suzuki

Proceedings of ACL | June 2007

Published by Association for Computational Linguistics

Download BibTex

We present a novel method for predicting inflected word forms for generating morphologically rich languages in machine translation. We utilize a rich set of syntactic and morphological knowledge sources from both source and target sentences in a probabilistic model, and evaluate their contribution in generating Russian and Arabic sentences. Our results show that the proposed model substantially outperforms the commonly used baseline of a trigram target language model; in particular, the use of morphological and syntactic features leads to large gains in prediction accuracy. We also show that the proposed method is effective with a relatively small amount of data.