Publikation

Collection and Curation of Language Data within the European Language Resource Coordination (ELRC)

Andrea Lösch, Valérie Mapelli, Khalid Choukri, Maria Giagkou, Stelios Piperidis, Prokopis Prokopidis, Vassilis Papavassiliou, Miltos Deligiannis, Aivars Berzins, Andrejs Vasiljevs, Eileen Marra, Thierry Declerck, Josef van Genabith

In: Adrian Paschke , Georg Rehm , Jamal Al Qundus , Clemens Neudecker , Lydia Pintscher (Hrsg.). Proceedings of the Conference on Digital Curation Technologies (Qurator 2021). Conference on Digital Curation Technologies (QURATOR-2021) February 8-12 Berlin Germany EUR-WS.org 2021.

Abstrakt

In order to help improve the quality, coverage and performance of au- tomated translation solutions for current and future Connecting Europe Facility (CEF) digital services, the European Language Resource Coordination (ELRC) was set up in 2015 through a service contract operating under the European Com- mission’s CEF SMART 2014/1074 programme. Since then, ELRC initiated a number of actions to support the collection of Language Resources (LRs) within the public sector in EU member and CEF-affiliated countries. All resources shared by the contributors were gathered and curated in the ELRC-SHARE Re- pository, after having passed the validation process developed by ELRC. This paper provides insights into the overall data collection and curation process (in- cluding both technical and legal validation of resources) employed within ELRC. The ELRC Helpdesk provides both technical and legal guidance (e.g. Intellectual Property Rights (IPR) clearance support) to potential data contributors, thus en- abling the sustainable sharing of language data.

Weitere Links

qurator2021_paper_6.pdf (pdf, 350 KB )

Deutsches Forschungszentrum für Künstliche Intelligenz
German Research Center for Artificial Intelligence