Evaluation of the NLP Components of an Information Extraction System for German

Thierry Declerck, Judith Klein, Günter Neumann

In: Proceedings of the 1st International Conference on Language Resources and Evaluation. International Conference on Language Resources and Evaluation (LREC-98) May 28-30 Granada Spain Pages 293-297 1 1998.


This paper describes ongoing work on the evaluation of the NLP components of the core engine of smes (Saarbrücker Message Extraction System), which consists of a tokenizer, an efficient and robust German morphology, a part-of-speech (POS) tagger, a shallow parsing module, a linguistic knowledge base and an output construction component. Currently the morphology, the tagger and a parsing module (NP grammar) are under evaluation, at distinct degrees of progress. We present the methodology used and the results obtained so far.

Weitere Links (gz, 25 KB)

German Research Center for Artificial Intelligence
Deutsches Forschungszentrum für Künstliche Intelligenz