Publikation

Context-Aware Neural Machine Translation Decoding

Eva Martínez Garcia, Carles Creus, Cristina España-Bonet

In: Fourth Workshop on Discourse in Machine Translation. Discourse in Machine Translation (DiscoMT-2019) befindet sich EMNLP-IJCNLP 2019 November 3 Hong Kong China Seiten 13-23 ACL 11/2019.

Abstrakt

This work presents a decoding architecture that fuses the information from a neural translation model and the context semantics enclosed in a semantic space language model based on word embeddings. The method extends the beam search decoding process and therefore can be applied to any neural machine translation framework. With this, we sidestep two drawbacks of current document-level systems: (i) we do not modify the training process so there is no increment in training time, and (ii) we do not require document-level an-notated data. We analyze the impact of the fusion system approach and its parameters on the final translation quality for English–Spanish. We obtain consistent and statistically significant improvements in terms of BLEU and METEOR and we observe how the fused systems are able to handle synonyms to propose more adequate translations as well as help the system to disambiguate among several translation candidates for a word.

Projekte

Weitere Links

contextDec_DiscoMT.pdf (pdf, 418 KB)

Deutsches Forschungszentrum für Künstliche Intelligenz
German Research Center for Artificial Intelligence