Publikation
Beyond Translation Memories: Generating Translation Suggestions based on Parsing and POS Tagging
Tapas Nayak; Santanu Pal; Sudip Kumar Naskar; Sivaji Bandyopadhyay; Josef van Genabith
In: In the Proceedings of the 2nd Workshop on Natural Language Processing for Translation Memories. Workshop on Natural Language Processing for Translation Memories (NLP4TM-2016), 2nd, May 28, Portoroz, Slovenia, 2016.
Zusammenfassung
This paper explores how translations of unmatched parts of an input sentence can be
discovered and inserted into Translation Memory (TM) suggestions generated by a
Computer Aided Translation (CAT) tool using a parse tree and part of speech (POS) tags to
form a new translation which is more suitable for post-editing. CATaLog (Nayek et al., 2015)
is a CAT tool based on TM and a modified Translation Error Rate (TER)(Snover et al., 2006)
metric. Unmatched parts of the sentence to be translated can often be found in some other
TM suggestions or in sentences which are not part of TM suggestions. Therefore, we can
find the translations of those unmatched parts within the TM database itself. If we can merge
the translations of the unmatched parts into one single sentence in a meaningful way, then
post-editing effort will be reduced.