DFKI-LT - Term Extraction and Mining Term Relations from Free-Text Documents in the Financial Domain
Term Extraction and Mining Term Relations from Free-Text Documents in the Financial Domain
3 Proceedings of the 5th International Conference on Business Information Systems (BIS'02), April 24-25, o.A., Poznan, Poland, 2002
In this paper, we present an unsupervised hybrid textmining approach to automatic acquisition of domain relevant terms and their relations. We deploy the TFIDFbased term classification method to acquire domain relevant terms. Further, we apply two strategies in order to learn lexico-syntatic patterns which indicate paradigmatic and domain relevant syntagmatic relations between the extracted terms. The first one uses GermaNet, while the second is based on different collocation acquisition methods to deal with free-word order languages like German. This domain-adaptive method yields good results even when trained on relative small training corpora. Therefore, it can be applied for solving information extraction and retrieval tasks within a realworld business information system.
Files: BibTeX, Bis2002.pdf