Proceedings-Artikel

Extraction of Multilingual Term Variants in the Business Reporting Domain

Thierry Declerck; Dagmar Gromann
In: Tatiana Gornostay (Hrsg.). Proceedings of CHAT 2012 The 2nd Workshop on the Creation, Harmonization and Application of Terminology Resources. Workshop on the Creation, Harmonization and Application of Terminology Resources (CHAT-12), located at TKE 2012, June 22, Madrid, Spain, Pages 41-47, ISBN 1650-3740, Linköping University Electronic Press, Linköping , 6/2012.

Abstract

Within the context of the European research project "Monnet", which implements among other activities ontology-based multilingual information extraction, we tackle the the issue of recognizing variants of concept labels in business reports that guide the information extraction process. In this short paper, we describe two related experiments in nding variants of multilingual taxonomy labels used in business reporting { across distinct reporting legislations and languages. A core taxonomy developed by the XBRL-Europe Association provides a starting point, as we map multilingual term variant candidates we extract from the web presence of relevant players in the eld of business reporting to its labels.

Projekte

MONNET
TrendMiner

Weitere Links

BibTeX

CHAT 2012 Proceedings.pdf