Annotation of Entities and Relations in Spanish Radiology Reports

Viviana Cotik; Darío Filippo; Roland Roller; Hans Uszkoreit; Feiyu Xu

In: Galia Angelova; Kalina Bontcheva; Ruslan Mitkov; Ivelina Nikolova; Irina Temnikova (Hrsg.). Proceedings of the International Conference Recent Advances in Natural Language Processing. International Conference on Recent Advances in Natural Language Processing (RANLP-2017), September 2-8, Varna, Bulgaria, INCOMA Ltd. Shoumen, Bulgaria, 9/2017.


Supervised machine learning methods are very popular to address information extraction, but are usually domain and language dependent. To train new classification models, annotated data is required. Moreover, annotated data is also required as an evaluation resource of information extraction algorithms. However, one major drawback of processing clinical data is the low availability of annotated datasets. For this reason we performed a manual annotation of radiology reports written in Spanish. This paper presents the corpus, the annotation schema, the annotation guidelines and further insight of the data.

Deutsches Forschungszentrum für Künstliche Intelligenz
German Research Center for Artificial Intelligence