DFKI-LT - Bootstrapping an Ontology-based Information Extraction System.

Alexander Maedche, GŁnter Neumann, Steffen Staab
Bootstrapping an Ontology-based Information Extraction System.
in: Piotr S. Szczepaniak, Javier Segovia, Janusz Kacprzyk, Lotfi A. Zadeh (eds.):
1 Intelligent Exploration of the Web volume 111,
Studies in Fuzziness and Soft Computing, Pages 345-359, Springer/Physica-Verlag GmbH, Heidelberg, 2003

Automatic intelligent web exploration will benefit from shallow information extraction techniques if the latter can be brought to work within many different domains. The major bottleneck for this, however, lies in the so far difficult and expensive modeling of lexical knowledge, extraction rules, and an ontology that together define the information extraction system. In this paper we present a bootstrapping approach that allows for the fast creation of an ontology-based information extracting system relying on several basic components, viz. a core information extraction system, an ontology engineering environment and an inference engine. We make extensive use of machine learning techniques to support the semi-automatic, incremental bootstrapping of the domain-specific target information extraction system.
Files: BibTeX