Hybrid Machine Translation Architectures within and beyond the EuroMatrix project

Proceedings of the Twelfth Conference of the European Association for Machine Translation |

Published by HITEC e.V.

This paper presents two hybrid architectures combining rulebased
and statistical machine translation (RBMT and SMT) technology.
In the rst case, several existing MT engines are combined in a multiengine
setup, and a decoder for SMT is used to select and combine the
best expressions proposed by di erent engines. The other architecture
uses lexical entries found using a combination of SMT technology with
shallow linguistic processing, which are then included in a rule-based system.
The rst architecture has been implemented in the framework of the
EuroMatrix project, where results from the recent evaluation campaign
give important insights into the strengths and weaknesses of the approach
relative to other participating RBMT and SMT systems. The second architecture
has been developed in collaboration with the European Patent
O ce who uses the enhanced RBMT system in the framework of their
European Machine Translation Project.