DFKI-LT - Shallow Processing with Unification and Typed Feature Structures - Foundations and Applications
Shallow Processing with Unification and Typed Feature Structures --- Foundations and Applications
5 Künstliche Intelligenz volume 1 number 1,
We present SProUT, a platform for the development of multilingual shallow text processing systems. A grammar in SProUT consists of a set of rules, where the left-hand side is a regular expression over typed feature structures (TFSs), representing the recognition pattern, and the right-hand side a TFS, specifying how the output structure looks like. The reusable core components of SProUT are a finite-state machine toolkit, a regular compiler, a finite-state machine interpreter, a typed feature structure package, and a set of linguistic processing resources. Several applications which make use of SProUT are presented. The system is implemented in Java and C(++), and runs under both MS Windows and Linux.
Files: BibTeX, sprout-web.pdf