DFKI-LT - The SAMMIE corpus of multimodal dialogues with an MP3 player

Ivana Kruijff-Korbayová, Tilman Becker, Nate Blaylock, Ciprian Gerstenberger, Michael Kaisser, Peter Poller, Verena Rieser, Jan Schehl
The SAMMIE corpus of multimodal dialogues with an MP3 player
1 Proceedings of The 5th Language Resources and Evaluation Conference, Pages 2018-2023, Genoa, Italy, ELDA, 2006
 
We describe a corpus of multimodal dialogues with an MP3player collected in Wizard-of-Oz experiments and annotated with a richfeature set at several layers. We are using the Nite XML Toolkit (NXT) to represent and further process the data. We designed an NXTdata model, converted experiment log file data and manualtranscriptions into NXT, and are building tools for additionalannotation using NXT libraries. The annotated corpus will be used to (i) investigate various aspects of multimodal presentation andinteraction strategies both within and across annotation layers; (ii) design an initial policy for reinforcement learning of multimodalclarification requests.
 
Files: BibTeX, 704.html, 704_pdf.pdf