DFKI-LT - Towards Deeper MT: Parallel Treebanks, Entity Linking, and Linguistic Evaluation

Ankit Srivastava, Vivien Macketanz, Aljoscha Burchardt, Eleftherios Avramidis
Towards Deeper MT: Parallel Treebanks, Entity Linking, and Linguistic Evaluation
4 Proceedings of The Workshop on Deep Language Processing for Quality Machine Translation, Varna, Bulgaria, Institute of Information and Communication Technologies Bulgarian Academy of Sciences, 2016
 
In this paper we investigate techniques to enrich Statisti- cal Machine Translation (SMT) with automatic deep linguistic tools and evaluate with a deeper manual linguistic analysis. Using English–German IT-domain translation as a case-study, we exploit parallel treebanks for syntax-aware phrase extraction and interface with Linked Open Data (LOD) for extracting named entity translations in a post decoding frame- work. We conclude with linguistic phenomena-driven human evaluation of our forays into enhancing the syntactic and semantic constraints on a phrase-based SMT system
 
Files: BibTeX, deeperMT.pdf