Publication

Towards the Harmonization and Segmentation of German Hashtags

Thierry Declerck, Piroska Lendvai

In: Stefanie Dipper (editor). Proceedings of the 3rd Workshop on Natural Language Processing for Computer-Mediated Communication. Workshop on Natural Language Processing for Computer-Mediated Communication (NLP4CMC-16) located at 13th Conference on Natural Language Processing (“Konferenz zur Verarbeitung natürlicher Sprache”, KONVENS) September 22 Bochum Germany ISBN ISSN 2190-0949 Bochumer Linguistische Arbeitsberichte Bichum 9/2016.

Abstract

We present on-going work on the harmonization and segmentation of German hashtags. Our aim is to reduce the number of variants of hashtags expressing the same content to one harmonized hashtag that can thus serve as a unique “annotation tag” for a large set of tweets.

Projekte

Hashtags_CMC_2016.pdf (pdf, 182 KB)

German Research Center for Artificial Intelligence
Deutsches Forschungszentrum für Künstliche Intelligenz