Skip to main content Skip to main navigation

Publication

A Flexible XML-based Regular Compiler for Creation and Converting Linguistic Resources

Jakub Piskorski; Witold Drozdzynski; Feiyu Xu; Oliver Scherf
In: Proceedings of the 3rd International Conference on Language Resources an Evaluation (LREC'02). International Conference on Language Resources and Evaluation (LREC), 2002.

Abstract

Finite-state devices are widely used to compactly model linguistic phenomena, whereas regular expressions are regarded as the adequate level of abstraction for thinking about finite-state languages. In this paper we present a flexible XML-based and Unicode-compatible regular compiler for creating, and integrating existing linguistic resources. Our tool provides user-friendly graphical interface which enables the transparent control of the compilation process and allows for testing generated finite-state grammars with several diagnostic tools. Through the direct database connection, existing linguistic resources can be converted into user-definable finite-state representations.