DFKI-LT - Qualitative Evaluation and Error Analysis of Phonetic Segmentation

Arif Khan, Ingmar Steiner
Qualitative Evaluation and Error Analysis of Phonetic Segmentation
in: Jürgen Trouvain, Ingmar Steiner, Bernd Möbius (eds.):
3 28th Conference on Electronic Speech Signal Processing (ESSV), Pages 138-144, Saarbrücken, Germany, TUD Press, Dresden, 3/2017
 
Speech segmentation is the process of splitting and identifying the boundaries between different units of speech, i.e., words, syllables, and phones. This paper focuses on the automatic phonetic segmentation of speech and the methods used for its evaluation. We explain the current methods used for the evaluation of speech segmentation and highlight the details that have not been sufficiently addressed in the literature. Several metrics are explained for analysis. The phones are grouped into several classes and the phone class transitions are observed. We found that, most of the errors comes from those class transitions which are also difficult for humans to segment.
 
Files: BibTeX, Khan.pdf, Khan.pdf