Towards the Generation of Semantically Enriched Multilingual Components of Ontology Labels

Thierry Declerck, Dagmar Gromann

In: Paul Buitelaar , Philipp Cimiano , David Lewis , James Pustejovsky , Felix Sasaki (Hrsg.). Proceedings of the 3rd International Workshop on the Multilingual Semantic Web (MSW3). International Workshop on the Multilingual Semantic Web (MSW-12) 3rd befindet sich ISWC 2012 November 11 Boston MA United States Seiten 11-23 CEUR Workshop Proceedings (CEUR) 936 ISBN ISSN 1613-0073 CEUR Aachen 11/2012.


Ontologies often contain multilingual textual information in annotation properties, such as rdfs:label and rdfs:comment. While the motivation for using such annotation properties is to provide a human readable description of abstract conceptualization of the domain, we notice that the importance of appropriate natural language use and representation is often neglected. The same can be observed with resources on the Web, such as multilingual taxonomies. Terms often lack consistency and completeness, hampering also an accurate automated natural language processing of such text. We propose a pattern-based transformation of terms in labels, thereby also supporting a multilingual alignment of (sub)components of labels. The source data for our approach is an ontology we derived from an industry classification taxonomy, which we improve as regards consistency and completeness and apply to the process of lexicalization


Deutsches Forschungszentrum für Künstliche Intelligenz
German Research Center for Artificial Intelligence