Skip to main content Skip to main navigation


Integrating Information Extraction and Automatic Hyperlinking

Stephan Busemann; Witold Drozdzynski; Hans-Ulrich Krieger; Jakub Piskorski; Ulrich Schäfer; Hans Uszkoreit; Feiyu Xu
In: Proceedings of the Interactive Posters/Demonstration at ACL-03. Annual Meeting of the Association for Computational Linguistics (ACL), Sapporo, Japan, Pages 117-120, 2003.


This paper presents a novel information system integrating advanced information extraction technology and automatic hyper-linking. Extracted entities are mapped into a domain ontology that relates concepts to a selection of hyperlinks. For information extraction, we use SProUT, a generic platform for the development and use of multilingual text processing components. By combining finite-state and unification-based formalisms, the grammar formalism used in SProUT offers both processing efficiency and a high degree of decalrativeness. The ExtraLink demo system showcases the extraction of relevant concepts from German texts in the tourism domain, offering the direct connection to associated web documents on demand.