Project

QT21

QT21: Quality Translation 21

QT21: Quality Translation 21

A European Digital Single Market free of barriers, including language barriers, is a stated EU objective to be achieved by 2020. The findings of the META-NET Language White Papers show that currently only 3 of the EU-27 languages enjoy moderate to good support by our machine translation technologies, with either weak (at best fragmentary) or no support for the vast majority of the EU-27 languages. This lack is a key obstacle impeding the free flow of people, information and trade in the European Digital Single Market.

Many of the languages not supported by our current technologies show common traits: they are morphologically complex, with free and diverse word order. Often there are not enough training resources and/or processing tools. Together this results in drastic drops in translation quality. The combined challenges of linguistic phenomena and resource scenarios have created a large and under-explored grey area in the language technology map of European languages. Combining support from key stakeholders, QT21 addresses this grey area developing

  • substantially improved statistical and machine-learning based translation models for challenging languages and resource scenarios,
  • improved evaluation and continuous learning from mistakes, guided by a systematic analysis of quality barriers, informed by human translators,
  • all with a strong focus on scalability, to ensure that learning and decoding with these models is efficient and that reliance on data (annotated or not) is minimised.

To continuously measure progress, and to provide a platform for sharing and collaboration (QT21 internally and beyond), the project revolves around a series of Shared Tasks, for maximum impact co-organised with the annual workshops on machine translation (WMT).

The project QT21 has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement no. 645452.

Partners

  • Deutsches Forschungszentrum für Künstliche Intelligenz GmbH, Germany, Co-ordinator
  • Rheinisch-Westfälische Technische Hochschule Aachen, Germany
  • Universiteit van Amsterdam, Netherlands
  • Dublin City University, Ireland
  • The University Of Edinburgh, United Kingdom
  • Karlsruher Institut für Technologie, Germany
  • Centre National de la Recherche Scientifique, France
  • Univerzita Karlova v Praze, Czech Republic
  • Fondazione Bruno Kessler, Italy
  • The University of Sheffield, United Kingdom
  • Taus Bv, Netherlands
  • Text & Form GmbH, Germany
  • Tilde Sia, Latvia
  • Hong Kong University of Science and Technology, Hong Kong

Share project:

Contact Person
Prof. Dr. Stephan Busemann

Publications about the project

Thierry Declerck, Rachele Sprugnoli

In: Antske Fokkens, Serge ter Braake, Ronald Sluijter, Paul Arthur, Eveline Wandl-Vogt (editor). Proceedings of the Second Conference on Biographical Data in a Digital World. Biographical Data in a Digital World (BD-2017) November 6-7 Linz Austria Pages 76-82 2119 ISBN ISSN 1613-0073 CEURS Aachen 6/2018.

To the publication

German Research Center for Artificial Intelligence
Deutsches Forschungszentrum für Künstliche Intelligenz