Streaming Text Analytics for Real-time Event RecognitionPhilippe Thomas; Johannes Kirschnick; Leonhard Hennig; Renlong Ai; Sven Schmeier; Holmer Hemsen; Feiyu Xu; Hans Uszkoreit
In: Proceedings of the International Conference Recent Advances in Natural Language Processing. International Conference on Recent Advances in Natural Language Processing (RANLP-17), September 4-6, Varna, Bulgaria, tbd, 9/2017.
A huge body of continuously growing written knowledge is available on the web. Real-time information extraction from such high velocity, high volume text streams requires scalable, distributed natural language processing pipelines. We introduce such a system for fine-grained event recognition within the Big Data framework Flink, and demonstrate its capabilities for extracting and localizing mobility- and industry-related events from heterogeneous text sources. Performance analyses conducted on several large datasets show that our system achieves high throughput and maintains low latency. We also present promising experimental results for the event extraction component of our system, which recognizes a novel set of event types. The demo system is available at sta-demo.appspot.com.