Towards a Sense-based Access to Related Online Lexical Resources

Thierry Declerck, Karlheinz Mörth

In: Tinatin Margalitadze , George Meladze (editor). Proceedings of the XVII EURALEX International Congress. EURALEX International Congress (EURALEX-16) Lexicography and Linguistic Diversity September 6-10 Tbilissi Georgia Pages 660-668 ISBN 978-9941-13-542-2 Ivane Javakhishvili Tbilisi University Press Tbilissi 9/2016.


We present an approach aiming at a method to support sense-based cross-dialectal – and cross-lexicon – access to dictionaries of spoken Arabic varieties. The original lexical data consists of three TEI P5 encoded dictionaries describing varieties spoken in Cairo, Damascus and Tunis. This data is included in the Vienna Corpus of Arabic Varieties (VICAV). We briefly present this data, before summarizing the TEI approach for encoding senses in lexical resources. We discuss certain issues related to the TEI representation when it comes to the possibility to provide for a sense-based access to the lexical data. We investigate the use of the recent W3C On tology-Lexicon Community Group (OntoLex) modeling work and present first re sults of the mapping of the TEI encoded data onto its ontolex model and show how this new representation format can efficiently support cross-dictionary sense-based access.


Proceedings_EURALEX2016_Declerck_Moerth_104_Final.pdf (pdf, 369 KB )

German Research Center for Artificial Intelligence
Deutsches Forschungszentrum für Künstliche Intelligenz