Publikation

Automatic Recognition of Speakers Age and Gender on the Basis of Empirical Studies

Christian Müller

In: Proceedings of the 9th International Conference on Spoken Language Processing. Conference in the Annual Series of Interspeech Events (INTERSPEECH-06) September 17-21 Pittsburg PA United States ISCA 2006.

Abstrakt

This paper describes a system that exploits the paralinguistic information in the speech to estimate the speakers' age and gender. Compared with previously published work, the so called AGENDER approach involves finer grained speaker classes and achieves a significantly higher classification accuracy. The introduction encompasses various application examples representing the actual AGENDER project context. Then hypotheses, method and a representative selection of results from extensive corpus analyses are presented, that build the empirical basis for the machine learning. Finally, the AGENDER approach on speaker classification is outlined, involving the comparison of different classification methods as well as evaluation results. The paper finishes with an outlook on extensions that are scheduled for the next project phase.

mueller2006.pdf (pdf, 108 KB )

Deutsches Forschungszentrum für Künstliche Intelligenz
German Research Center for Artificial Intelligence