Skip to main content Skip to main navigation



PArametrisation of prosody and VOice QUality for concatenative speech synthesis in view of Emotion expression

  • Duration:

A major obstacle for the acceptability of speech synthesis is its lack of expressivity. In order to convey emotions or other expressions appropriately, the sound of the synthetic voice would need to be changed; however, newer speech synthesis methods lack the possibility to influence the relevant parameters to the necessary extent.

In current speech synthesis technology, naturalness and flexibility are mutually exclusive: newer corpus-based unit selection synthesis methods often sound natural, but they can only realise a single speaking style, which is determined during the recordings of the speech corpus. In contrast, older methods such as formant or diphone synthesis are parametrisable but sound quite unnatural. There is currently no synthesis method combining the naturalness of corpus-based synthesis with the parametrisability of earlier systems.

The PAVOQUE project is to make a core contribution to reconciling synthesis quality and parametrisability. In a current corpus-based speech synthesis system, it carries out research on methods for the required parametrisation of the key parameters for vocal emotion expression: prosody (=intonation and rhythm) and voice quality. Two strategies are pursued: parameter-based selection of units from the corpus, and post-processing of the synthetic speech signal with signal manipulation methods. This will allow for a high degree of expressivity while maintaining good quality of the speech signal.


DFG - German Research Foundation

DFG - German Research Foundation

Publications about the project

Ingmar Steiner; Marc Schröder; Annette Klepp

In: Phonetik & Phonologie 9. Phonetik & Phonologie (P&P-9), October 11-12, Zurich, Switzerland, Pages 83-84, Peter Lang, 10/2013.

To the publication

Marc Schröder; Marcela Charfuelan Oliva; Sathish Pammi; Ingmar Steiner

In: 12th Annual Conference of the International Speech Communication Association. Conference in the Annual Series of Interspeech Events (INTERSPEECH-2011), 12th, August 28-31, Florence, Italy, ISCA, 8/2011.

To the publication

Sathish Chandra Pammi; Marcela Charfuelan Oliva; Marc Schröder

In: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10). International Conference on Language Resources and Evaluation (LREC-2010), May 19-21, Valleta, Malta, ISBN 2-9517408-6-7, ELRA, 5/2010.

To the publication