DFKI-LT - Integration of the Thesaurus for the Social Sciences (TheSoz) in an Information Extraction System

Thierry Declerck
Integration of the Thesaurus for the Social Sciences (TheSoz) in an Information Extraction System
in: Piroska Lendvai, Kalliopi Zervanou (eds.):
1 Proceedings of the 7th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, Pages 90-95, Sofia, Bulgaria, Association for Computational Linguistics, 8/2013
 
We present current work dealing with the in-tegration of a multilingual thesaurus for so-cial sciences in a NLP framework for sup-porting Knowledge-Driven Information Ex-traction in the field of social sciences. We describe the various steps that lead to a run-ning IE system: lexicalization of the labels of the thesaurus and semi-automatic generation of domain specific IE grammars, with their subsequent implementation in a finite state engine. Finally, we outline the actual field of application of the IE system: analysis of so-cial media for recognition of relevant topics in the context of elections.
 
Files: BibTeX, LaTECH_l2013_TheSoZ_Final2.pdf