DFKI-LT - Accurat

Analysis and Evaluation of Comparable Corpora for Under-Resourced Areas of Machine Translation

The project aims at researching methods and techniques to overcome one of the central problems of machine translation (MT) – the lack of linguistic resources such as training data for under-resourced areas of machine translation. The main goal is to find, analyze and evaluate novel methods that exploit comparable corpora on order to compensate for the shortage of linguistic resources, and ultimately to significantly improve MT quality for under-resourced languages and narrow domains. Models generated from comparable corpora will be compared against baseline models generated from parallel corpora.

Funded by:European Union
Project Manager:Stephan Busemann (Stephan.Busemann@dfki.de)
Contact:Sabine Hunsicker (Sabine.Hunsicker@dfki.de)
Duration: 01.01.2010 - 30.06.2012
URL:http://www.accurat-project.eu/
Partners:LatviaTilde,
EnglandSheffield University,
EnglandUniversity of Leeds,
GreeceILSP: Institute for Language and Speech Processing, Athena,
HungaryUniversity of Zagreb,
RomaniaRACAI: Romanian Academy,
GermanyLinguatec,
SloveniaZementa