DFKI-LT - Statistical Machine Transliteration with Multi-to-Multi Joint Source Channel Model
Statistical Machine Transliteration with Multi-to-Multi Joint Source Channel Model
3 Proceedings of the Named Entities Workshop Shared Task on Machine Transliteration, Chiang Mai, Thailand, Association for Computational Linguistics, 11/2011
This paper describes DFKI's participation in the NEWS2011 shared task on machine transliteration. Our primary system participated in the evaluation for English-Chinese and Chinese-English language pairs. We extended the joint source-channel model on the transliteration task into a multi-to-multi joint source-channel model, which allows alignments between substrings of arbitrary lengths in both source and target strings. When the model is integrated into a modified phrase-based statistical machine translation system, around 20% of improvement is observed. The primary system achieved 0.320 on English-Chinese and 0.133 on Chinese-English in terms of top-1 accuracy.
Files: BibTeX, NEWS2011.pdf