DFKI-LT - Linguistics to Structure Unstructured Information

Günter Neumann, Gerhard Paaß, David van den Akker
Linguistics to Structure Unstructured Information
1 Towards the Internet of Services: The THESEUS Program, Pages 383-392, Springer, 2014
 
The extraction of semantics of unstructured documents requires the recognition and classification of textual patterns, their variability and their inter-relationships, i.e. the analysis of the linguistic structure of documents. Being the integral part of a larger real-life application, this linguistic analysis process must be robust, fast and adaptable. This creates a big challenge for the development of the necessary linguistic base components. In this drill-down we present several dimensions of this challenge and show how they have been successfully tackled in ORDO.
 
Files: BibTeX, Buch_ORDO_Ling_V0.5-OhneComments.pdf