Harmonization of German Lexical Resources for Opinion Mining

Thierry Declerck, Hans-Ulrich Krieger

In: Nicoletta Calzolari , Khalid Choukri , Thierry Declerck , Hrafn Loftsson , Bente Maegaard , Joseph Mariani , Jan Odijk , Stelios Piperidis (editor). Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC-2014). International Conference on Language Resources and Evaluation (LREC-14) May 28-30 Reykjavik Ireland ELRA Paris 5/2014.


We present on-going work on the harmonization of existing German lexical resources in the field of opinion and sentiment mining. The input of our harmonization effort consisted in four distinct lexicons of German word forms, encoded either as lemmas or as full forms, marked up with polarity features, at distinct granularity levels. We describe how the lexical resources have been mapped onto each other, generating a unique list of entries, with unified Part-of-Speech information and basic polarity features. Future work will be dedicated to the comparison of the harmonized lexicon with German texts annotated with polarity information. We are further aiming at both linking the harmonized German lexical resources with similar resources in other languages and publishing the resulting set of lexical data in the context of the Linguistic Linked Open Data cloud.


HarmonizationOfGermanLexcialResourcesOM_final_LREC2014.pdf (pdf, 318 KB )

German Research Center for Artificial Intelligence
Deutsches Forschungszentrum für Künstliche Intelligenz