A Machine-Learning Framework for Hybrid Machine Translation

Christian Federmann

In: Proceedings of the 35th German Conference on Artificial Intelligence. German Conference on Artificial Intelligence (KI-12) September 24-27 Saarbrücken Germany Springer 9/2012.


We present a Machine-Learning-based framework for hybrid Machine Translation. Our approach combines translation output from several black-box source systems. We define an extensible, total order on translation output and use this to decompose the n-best translations into pairwise system comparisons. Using joint, binarised feature vectors we train an SVM-based classifier and show how its classification output can be used to generate hybrid translations on the sentence level. Evaluations using automated metrics shows promising results. An interesting finding in our experiments is the fact that our approach allows to leverage good translations from otherwise bad systems as the combination decision is taken on the sentence instead of the corpus level. We conclude by summarising our findings and by giving an outlook to future work, e.g., on probabilistic classification or the integration of manual judgements.

Deutsches Forschungszentrum für Künstliche Intelligenz
German Research Center for Artificial Intelligence