Sherlock: A System for Interactive Summarization of Large Text CollectionsAvinesh P. V. S.; Carsten Binnig; Benjamin Hättasch; Christian M. Meyer; Orkan Özyurt
In: Proceedings of the VLDB Endowment (PVLDB), Vol. 11, No. 12, Pages 1902-1905, Association for Computing Machinery (ACM), 2018.
There exists an ever-growing set of data-centric systems that allow data scientists of varying skill levels to interactively manipulate, analyze and explore large structured data sets. However, there are currently not many systems that allow data scientists and novice users to interactively explore large unstructured text document collections from heterogeneous sources. In this demo paper, we present a new system for interactive text summarization called Sherlock. The task of automatically producing textual summaries is an important step to understand a collection of multiple topic-related documents. It has many real-world applications in journalism, medicine, and many more. However, none of the existing summarization systems allow users to provide feedback at interactive speed. We therefore integrate a new approximate summarization model into Sherlock that can guarantee interactive speeds even for large text collections to keep the user engaged in the process.