Variable Voice Likability Affecting Subjective Speech Quality Assessments

Laura Fernández Gallardo, Gabriel Mittag, Sebastian Möller, John Beerends

In: 2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX). International Conference on Quality of Multimedia Experience (QoMEX-2018) May 29-June 1 Cagliari Italy Seiten 15-30 ISBN 978-1-5386-2605-4 IEEE 2018.


In telephone conversations, transmitted speech of good to excellent quality is desired for enhanced Quality of Experience and to sustain lasting customer loyalty. Subjective mean opinion scores account for perceived transmitted quality, while instrumental models, such as POLQA, are able to estimate the subjective judgments. To perform subjective or instrumental quality measurements, the International Telecommunication Union recommends to employ two sentences from both, male and female speakers as speech material. In this paper, we have examined whether subjective and instrumental MOS ratings are affected by perceptual voice likability. A listening test has been conducted over 8 degradations with 12 extremely likable and unlikable male and female speakers. Statistically significant effects of gender and of voice likability have been detected on subjective MOS, whereas instrumental MOS was only affected by gender differences. These results can contribute to further improvements needed in the POLQA perceptual modeling, as well as to the selection of speakers for speech quality assessment tests.

Deutsches Forschungszentrum für Künstliche Intelligenz
German Research Center for Artificial Intelligence