A Flexible XML-based Regular Compiler for Creation and Converting Linguistic Resources

Jakub Piskorski, Witold Drozdzynski, Feiyu Xu, Oliver Scherf

In: Proceedings of the 3rd International Conference on Language Resources an Evaluation (LREC'02). International Conference on Language Resources and Evaluation (LREC) 2002.


Finite-state devices are widely used to compactly model linguistic phenomena, whereas regular expressions are regarded as the adequate level of abstraction for thinking about finite-state languages. In this paper we present a flexible XML-based and Unicode-compatible regular compiler for creating, and integrating existing linguistic resources. Our tool provides user-friendly graphical interface which enables the transparent control of the compilation process and allows for testing generated finite-state grammars with several diagnostic tools. Through the direct database connection, existing linguistic resources can be converted into user-definable finite-state representations.

regular_compiler.pdf (pdf, 967 KB )

German Research Center for Artificial Intelligence
Deutsches Forschungszentrum für Künstliche Intelligenz