<html>

  <head>

    <meta content="text/html; charset=ISO-8859-1"

      http-equiv="Content-Type">

  </head>

  <body bgcolor="#FFFFFF" text="#000000">

    Dear Anderson, <br>

    <br>

    The hts_engine used during training, is the original provided by

    HTS, so it does not use the mixed excitation that we have

    implemented, therefore it sound quite different, I have left that

    step on the training procedure just to verify that it is completed

    successfully.<br>

    <br>

    In Mary we have a ported version of the HTS engine to java, which

    uses our own version of mixed excitation and of course MARY context

    features. On the configuration file of the voice you can modify some

    options to tune a bit your voice, so it does not sound so metallic

    or rough;<br>

    # Parameter beta for postfiltering&nbsp;

<br>

    voice.cmu-slt-hsmm.beta = 0.1&nbsp;&nbsp;

(the same effect described in the

    original hts_engine)<br>

    <br>

    &nbsp;you can also tune a bit the number of iterations for GV:<br>

    # Information about Global Mean and Variance PDFs

<br>

    voice.cmu-slt-hsmm.useGV = true

<br>

    voice.cmu-slt-hsmm.maxMgcGvIter = 200

<br>

    voice.cmu-slt-hsmm.maxLf0GvIter = 200<br>

    <br>

    Regarding the other problem, could you provide more information,

    which version of MARY are you using, in which OS, etc.<br>

    <br>

    Regards, <br>

    Marcela.<br>

    <br>

    <br>

    On 04/21/2012 06:19 PM, Anderson de Oliveira Monte wrote:

    <blockquote

cite="mid:CADyZqaV-DdsziYot_7PFY3BeD5j_tnLvznRi_VQuVUhu4G7LRA@mail.gmail.com"

      type="cite">Hello Marcela,

      <div><br>

      </div>

      <div>Thanks for the settings. Now I have another problem to solve.</div>

      <div><br>

      </div>

      <div>The voice that MARY synthesizes sounds rougher than the one

        synthesized by the hts-engine during the HTS script training.</div>

      <div><br>

      </div>

      <div>In addition to that, an exception occurrs when I try to save

        the audio of this voice in .wav and .aiff files, it's only

        possible to save it in au. files.</div>

      <div><br>

      </div>

      <div>In the attachments you will find:</div>

      <div><br>

      </div>

      <div>- Examples of the voice synthesized by hts-engine

        (gen_voz0001.wav) and by MARY TTS (MARY_voz0001.au);</div>

      <div>- The dialog box of the exception;</div>

      <div>- A text file containing the error message of the exception.</div>

      <div><br>

      </div>

      <div>Is there any configuration setting that I'm still missing?</div>

      <div><br>

      </div>

      <div>Regards,</div>

      <div><br>

      </div>

      <div>Anderson<br>

        <br>

        <div class="gmail_quote">2012/4/16 Marcela Charfuelan <span

            dir="ltr">&lt;<a moz-do-not-send="true"

              href="mailto:Marcela.Charfuelan@dfki.de" target="_blank">Marcela.Charfuelan@dfki.de</a>&gt;</span><br>

          <blockquote class="gmail_quote" style="margin:0 0 0

            .8ex;border-left:1px #ccc solid;padding-left:1ex">

            <div bgcolor="#ffffff" text="#000000"> Dear Anderson de

              Oliveira Monte,<br>

              <br>

              The default settings are for 16Kz,<br>

              HMMVoiceConfigure.fftLen 512<br>

              HMMVoiceConfigure.frameLen 400&nbsp; (0.025 sec)<br>

              HMMVoiceConfigure.frameShift 80&nbsp;&nbsp; (0.005 sec)<br>

              HMMVoiceConfigure.freqWarp 0.42<br>

              <br>

              so if you want to train with 22050, please take into

              account the following settings:<br>

              <br>

              HMMVoiceConfigure.fftLen 512<br>

              HMMVoiceConfigure.frameLen ~ 550&nbsp; (0.025*22050)<br>

              HMMVoiceConfigure.frameShift ~ 110&nbsp; (0.005*22050)<br>

              HMMVoiceConfigure.freqWarp&nbsp; 0.45<br>

              <br>

              Frequency warping factor:<br>

              8000 FREQWARP=0.31<br>

              10000 FREQWARP=0.35<br>

              12000 FREQWARP=0.37<br>

              16000 FREQWARP=0.42<br>

              22050 FREQWARP=0.45<br>

              32000 FREQWARP=0.45<br>

              44100 FREQWARP=0.53<br>

              48000 FREQWARP=0.55<br>

              <br>

              Regards,<br>

              Marcela.

              <div>

                <div><br>

                  <br>

                  On 04/13/2012 05:52 AM, Anderson de Oliveira Monte

                  wrote: </div>

              </div>

              <blockquote type="cite">

                <div>

                  <div>

                    <div>Hello,</div>

                    <div><br>

                    </div>

                    <div>I was able to build and install in MARY TTS a

                      Brazilian Portuguese HMM Voice using wav files

                      that were downsampled to 16000Hz.</div>

                    <div><br>

                    </div>

                    <div>Now I'm trying to build another voice with the

                      same wav files in their original sampling rate

                      (22050Hz). I've made two attempts, each one with

                      the following sets of external programs:</div>

                    <div><br>

                    </div>

                    <div><b><u>RECOMMENDED SET (by MARY's HMM-based

                          voice building tutorial):</u></b></div>

                    <div><b>HTS-2.1</b></div>

                    <div><b>HTK-3.4</b></div>

                    <div><b>HDecode-3.4</b></div>

                    <div><b>SPTK-3.2&nbsp;</b></div>

                    <div><b>hts_engine_API-1.01&nbsp;</b></div>

                    <div><b>========================================================================</b></div>

                    <div><b><u>ALTERNATIVE SET:</u></b></div>

                    <div><b>HTS-2.1.1</b></div>

                    <div><b>HTK-3.4.1</b></div>

                    <div><b>HDecode-3.4.1</b></div>

                    <div> <b>SPTK-3.3&nbsp;</b></div>

                    <div><b>hts_engine_API-1.03</b></div>

                    <div><br>

                    </div>

                    <div>With the RECOMMENDED set, I was able to build

                      and install the voice in MARY without erros.

                      However, it's impossible to understand what it

                      says (I believe it has something to do with the

                      alignment step, although no error was reported

                      during the whole voice building process). With the

                      ALTERNATIVE set, I was able to build a nice voice,

                      better than the one in 16000Hz (I checked it at

                      the directories under

                      "hts/gen/qst001/ver1/1mix/"), but errors ocurred

                      when the training process tried to synthesize

                      waveforms using the hts-engine (not only in

                      version 1.03, but also in versions 1.01 and 1.06).

                      In addition to that, I tried to install and use

                      this voice in MARY TTS, but errors also occurred

                      when the MARY SERVER tried to load it.</div>

                    <div><br>

                    </div>

                    <div>I would like to know if anybody has already

                      built a HMM voice in 22050Hz in MARY. If yes, how

                      did you solve such issues and what versions of the

                      mentioned programs did you use? Is there any

                      version of MARY TTS with improvements or bugfixes

                      to solve these issues?</div>

                    <div><br>

                    </div>

                    <div>All the best,</div>

                    <div><br>

                    </div>

                    <div>Anderson</div>

                  </div>

                </div>

                <pre><fieldset></fieldset>

_______________________________________________

Mary-users mailing list

<a moz-do-not-send="true" href="mailto:Mary-users@dfki.de" target="_blank">Mary-users@dfki.de</a>

<a moz-do-not-send="true" href="http://www.dfki.de/mailman/cgi-bin/listinfo/mary-users" target="_blank">http://www.dfki.de/mailman/cgi-bin/listinfo/mary-users</a>

</pre>

              </blockquote>

              <br>

              <br>

              <pre cols="72">-- 

_______________________________________________________________

 Marcela Charfuelan, Researcher, DFKI GmbH

 Projektbuero Berlin, Alt-Moabit 91c, D-10559 Berlin, Germany

 Phone: <a moz-do-not-send="true" href="tel:%2B49%20%280%2930%2023895-1821" value="+4930238951821" target="_blank">+49 (0)30 23895-1821</a>

 URL  : <a moz-do-not-send="true" href="http://www.dfki.de/%7Echarfuel/" target="_blank">http://www.dfki.de/~charfuel/</a>

_______________________________________________________________

 Deutsches Forschungszentrum fuer Kuenstliche Intelligenz GmbH

 Firmensitz: Trippstadter Strasse 122, D-67663 Kaiserslautern

 Geschaeftsfuehrung:

 Prof. Dr. Dr. h.c. mult. Wolfgang Wahlster (Vorsitzender)

 Dr. Walter Olthoff 

 Vorsitzender des Aufsichtsrats:

 Prof. Dr. h.c. Hans A. Aukes

 Amtsgericht Kaiserslautern, HRB 2313

_______________________________________________________________

</pre>

            </div>

          </blockquote>

        </div>

        <br>

      </div>

    </blockquote>

    <br>

    <br>

    <pre class="moz-signature" cols="72">-- 

_______________________________________________________________

 Marcela Charfuelan, Researcher, DFKI GmbH

 Projektbuero Berlin, Alt-Moabit 91c, D-10559 Berlin, Germany

 Phone: +49 (0)30 23895-1821

 URL  : <a class="moz-txt-link-freetext" href="http://www.dfki.de/~charfuel/">http://www.dfki.de/~charfuel/</a>

_______________________________________________________________

 Deutsches Forschungszentrum fuer Kuenstliche Intelligenz GmbH

 Firmensitz: Trippstadter Strasse 122, D-67663 Kaiserslautern

 Geschaeftsfuehrung:

 Prof. Dr. Dr. h.c. mult. Wolfgang Wahlster (Vorsitzender)

 Dr. Walter Olthoff 

 Vorsitzender des Aufsichtsrats:

 Prof. Dr. h.c. Hans A. Aukes

 Amtsgericht Kaiserslautern, HRB 2313

_______________________________________________________________

</pre>

  </body>

</html>