Skip to main content Skip to main navigation



Technologies for Advanced Knowledge Extraction

Technologies for Advanced Knowledge Extraction

  • Duration:

The project TAKE aims to adapt, develop and utilize a range of language and knowledge technologies for the gradual automatic extraction of knowledge from the World Wide Web. Rule-based and statistical methods for language processing will be combined for systematically extending a body of formalized knowledge.

The central technology for this endeavor is semantically driven advanced information extraction, especially relation extraction, i.e., the detection of instances of semantic relations in large volumes of texts. Such relevant relations may belong to several classes such as facts, definitions, events, citations and opinions.

In TAKE, information extraction is not viewed as a pragmatic shortcut to getting at least something out of natural language texts but rather as a method for gradually approaching the unsolved problem of text understanding in a systematic and controlled way.

Existing bodies of formalized linguistic knowledge such as lexicons, morphologies and grammars will be utilized as well as tools for statistical processing.

The developed methods, architectures and systems will be tested and demonstrated in two knowledge domains:

  • scientific/technological literature in a selected field of research, i.e., language technology, and
  • general biographical texts.

TAKE is funded under contract 01IW08003.


BMBF - Federal Ministry of Education and Research

BMBF - Federal Ministry of Education and Research

Publications about the project

Ulrich Schäfer; Bernd Kiefer; Christian Spurk; Jörg Steffen; Rui Wang; Benjamin Weitz; Magdalena Wolska

In: LIBER quarterly, Vol. 22, No. 4, Pages 285-309, Association of European Research Libraries, 2/2013.

To the publication

Feiyu Xu; Hans Uszkoreit; Hong Li; Peter Adolphs; Xiwen Cheng

In: Hermann Friedrich; Hans-Joachim Grallert; Wolfgang Wahlster; Stefan Wess; Thomas Widenka (Hrsg.). Theseus-Buch. Chapter X, Springer, 2013.

To the publication

Ulrich Schäfer; Christian Spurk; Jörg Steffen

In: Proceedings of the 24th International Conference on Computational Linguistics. International Conference on Computational Linguistics (COLING-2012), December 10-14, Mumbai, India, Pages 1059-1070, ICCL, 12/2012.

To the publication