Designing the Dialogue Component in a Speech Translation System - a Corpus Based Approach

Jan Alexandersson, Norbert Reithinger

In: Proceedings of the 9th Twente Workshop on Language Technology (Corpus Based Approaches to Dialogue Modelling). Twente Workshop on Language Technology (TWLT-1995) 9th December 6-8 Twente Netherlands 1995.


New and challenging requirements arise for the dialogue processing component in the speech-to-speech translation system Verbmobilcomponent. It has to cope with both unexpected and vague input as well as gaps in the input. The design is based on a large corpus of transliterated dialogues which provide the data to start from. A careful analysis of this corpus and of the requirements from other components of verbmobil resulted in a hybrid approach consisting of both knowledge based as well as statistic based processing. In this paper, we present the design process and the resulting architecture. Using the corpus, we made various experiments to evaluate the first design of the component. This work was funded by the German Federal Ministry for Research and Technology (BMBF) in the framework of the verbmobil Project under Grant 01IV101K/1. The responsibility for the contents of this study lies with the authors.

Deutsches Forschungszentrum für Künstliche Intelligenz
German Research Center for Artificial Intelligence