DFKI-LT - Can Machine Learning Algorithms Improve Phrase Selection in Hybrid Machine Translation?
Can Machine Learning Algorithms Improve Phrase Selection in Hybrid Machine Translation?
1 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics,
We describe a substitution-based, hybrid machine translation (MT) system that has been extended with a machine learning component controlling its phrase selection. Our approach is based on a rule-based MT (RBMT) system which creates template translations. Based on the generation parse tree of the RBMT system and standard word alignment computation, we identify potential translation snippets from one or more translation engines which could be substituted into our translation templates. The substitution process is controlled by a binary classifier trained on feature vectors from the different MT engines. Using a set of manually annotated training data, we are able to observe improvements in terms of BLEU scores over a baseline version of the hybrid system.
Files: BibTeX, eacl12_cfedermann.pdf