[mary-users] Mary-users Digest, Vol 51, Issue 9

Thorsten Westermann thorstenwestermann at gmx.net
Wed Sep 15 09:41:01 CEST 2010


I have spent a lot of time to make sure that this really is a bug, and it really is.

I guess the problem occurs when the next sentence starts with a "?", e. g. "? I"

Does anybody know if there is a quick workaround or if the problem has to be corrected by the Mary developers first, and then we need to re-create our voices to create a correct timeline?

The start times of the halfphones however are correct, that's why the bug didn't become obvious I guess.

MfG,
Westermann



-------- Original-Nachricht --------
> Datum: Tue, 14 Sep 2010 12:00:04 +0200
> Von: mary-users-request at dfki.de
> An: mary-users at dfki.de
> Betreff: Mary-users Digest, Vol 51, Issue 9

> Send Mary-users mailing list submissions to
> 	mary-users at dfki.de
> 
> To subscribe or unsubscribe via the World Wide Web, visit
> 	http://www.dfki.de/mailman/cgi-bin/listinfo/mary-users
> or, via email, send a message with subject or body 'help' to
> 	mary-users-request at dfki.de
> 
> You can reach the person managing the list at
> 	mary-users-owner at dfki.de
> 
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Mary-users digest..."
> 
> 
> Today's Topics:
> 
>    1. Bug with commas? (Thorsten Westermann)
>    2. Bug with commas? (Thorsten Westermann)
>    3. Re: Bug with commas? (Ingmar Steiner)
>    4. Re: Bug with commas? (Thorsten Westermann)
> 
> 
> ----------------------------------------------------------------------
> 
> Message: 1
> Date: Mon, 13 Sep 2010 16:58:45 +0200
> From: "Thorsten Westermann" <thorstenwestermann at gmx.net>
> Subject: [mary-users] Bug with commas?
> To: mary-users at dfki.de, mary-users at dfki.de
> Message-ID: <20100913145845.142040 at gmx.net>
> Content-Type: text/plain; charset="utf-8"
> 
> Hello!
>  
> can anyone please confirm (the bug) that if you place a comma in a
> sentence in the txt.done.data file, only the first half of the sentence is used in
> the voice creation process for a diphone voice? I am talking about version
> 4.0.0. 
>  
> For example:
> When you have the following sentence in your txt.done.data file:
> 
> "Mein Name ist Thorsten, das wollte ich mal gesagt haben."
>  
> ... then it will only process the "Mein Name ist Thorsten".
>  
> The "das wollte ich doch mal gesagt haben" will not occur in the
> halfphones later on when I try to synthesize a text, that's why I guess that there
> is a bug in the voice creation part.
>  
> As a workaround, can I remove the comma, or will it affect anything?
> I have not yet understood if commas influence the byte features of a
> halfphone. Can anyone tell me?
> 
> Thank you for your reply.
> 
> MfG,
> Westermann
> 
> -- 
> GRATIS: Spider-Man 1-3 sowie 300 weitere Videos!
> Jetzt freischalten! http://portal.gmx.net/de/go/maxdome
> 
> 
> ------------------------------
> 
> Message: 2
> Date: Mon, 13 Sep 2010 16:58:45 +0200
> From: "Thorsten Westermann" <thorstenwestermann at gmx.net>
> Subject: [mary-users] Bug with commas?
> To: mary-users at dfki.de, mary-users at dfki.de
> Message-ID: <20100913145845.142040 at gmx.net>
> Content-Type: text/plain; charset="utf-8"
> 
> Hello!
>  
> can anyone please confirm (the bug) that if you place a comma in a
> sentence in the txt.done.data file, only the first half of the sentence is used in
> the voice creation process for a diphone voice? I am talking about version
> 4.0.0. 
>  
> For example:
> When you have the following sentence in your txt.done.data file:
> 
> "Mein Name ist Thorsten, das wollte ich mal gesagt haben."
>  
> ... then it will only process the "Mein Name ist Thorsten".
>  
> The "das wollte ich doch mal gesagt haben" will not occur in the
> halfphones later on when I try to synthesize a text, that's why I guess that there
> is a bug in the voice creation part.
>  
> As a workaround, can I remove the comma, or will it affect anything?
> I have not yet understood if commas influence the byte features of a
> halfphone. Can anyone tell me?
> 
> Thank you for your reply.
> 
> MfG,
> Westermann
> 
> -- 
> GRATIS: Spider-Man 1-3 sowie 300 weitere Videos!
> Jetzt freischalten! http://portal.gmx.net/de/go/maxdome
> 
> 
> ------------------------------
> 
> Message: 3
> Date: Mon, 13 Sep 2010 17:32:08 +0200
> From: Ingmar Steiner <ingmar.steiner at dfki.de>
> Subject: Re: [mary-users] Bug with commas?
> To: "Thorsten Westermann" <thorstenwestermann at gmx.net>
> Cc: mary-users at dfki.de
> Message-ID: <DF46135E-744A-482D-8BB6-9AC1C5A7CD5F at dfki.de>
> Content-Type: text/plain; charset=utf-8
> 
> Dear Thorsten,
> 
> first of all, could you please confirm that this issue persists with the
> current Mary release 4.1.1?
> 
> If so, please provide details on which voice import component first
> truncates your utterance, i.e. at which point in the voicebuilding process is the
> second clause dropped?
> 
> In addition, please make sure that you do not have an XML file in
> BASEDIR/rawmaryxml that could override the text.
> 
> P.S. Are you really creating a diphone voice, or rather a unit selection
> voice? =)
> 
> Best wishes,
> 
> /**
>  * Ingmar Steiner
>  * Researcher, Language Technology
>  * German Research Center for Artificial Intelligence
>  *
>  * Campus D3 1 +1.18
>  * D-66123 Saarbr?cken
>  * Germany
>  * Phone: ++49-681-857-75-5263 (NEW!)
>  * Email: ingmar.steiner at dfki.de
>  *
>  * Deutsches Forschungszentrum f?r K?nstliche Intelligenz GmbH
>  * Trippstadter Stra?e 122, D-67663 Kaiserslautern, Germany
>  * Gesch?ftsf?hrung:
>  * Prof. Dr. Dr. h.c. mult. Wolfgang Wahlster (Vorsitzender)
>  * Dr. Walter Olthoff
>  * Vorsitzender des Aufsichtsrats:
>  * Prof. Dr. h.c. Hans A. Aukes
>  * Amtsgericht Kaiserslautern, HRB 2313
>  */
> 
> On 13 Sep 2010, at 16:58, Thorsten Westermann wrote:
> 
> > Hello!
> > 
> > can anyone please confirm (the bug) that if you place a comma in a
> sentence in the txt.done.data file, only the first half of the sentence is used
> in the voice creation process for a diphone voice? I am talking about
> version 4.0.0. 
> > 
> > For example:
> > When you have the following sentence in your txt.done.data file:
> > 
> > "Mein Name ist Thorsten, das wollte ich mal gesagt haben."
> > 
> > ... then it will only process the "Mein Name ist Thorsten".
> > 
> > The "das wollte ich doch mal gesagt haben" will not occur in the
> halfphones later on when I try to synthesize a text, that's why I guess that there
> is a bug in the voice creation part.
> > 
> > As a workaround, can I remove the comma, or will it affect anything?
> > I have not yet understood if commas influence the byte features of a
> halfphone. Can anyone tell me?
> > 
> > Thank you for your reply.
> > 
> > MfG,
> > Westermann
> > 
> > -- 
> > GRATIS: Spider-Man 1-3 sowie 300 weitere Videos!
> > Jetzt freischalten! http://portal.gmx.net/de/go/maxdome
> > _______________________________________________
> > Mary-users mailing list
> > Mary-users at dfki.de
> > http://www.dfki.de/mailman/cgi-bin/listinfo/mary-users
> 
> 
> 
> ------------------------------
> 
> Message: 4
> Date: Tue, 14 Sep 2010 10:08:48 +0200
> From: "Thorsten Westermann" <thorstenwestermann at gmx.net>
> Subject: Re: [mary-users] Bug with commas?
> To: Ingmar Steiner <ingmar.steiner at dfki.de>
> Cc: mary-users at dfki.de
> Message-ID: <20100914080848.285320 at gmx.net>
> Content-Type: text/plain; charset="utf-8"
> 
> I am sorry, it was a bug in my brain, not in Mary.
> 
> I was fighting with extracting some half(phones) because I tried to
> understand more why we use datagrams instead of simple positions, e. g.
> "ByteStart" and "Duration". 
> 
> I tried to have extract the bytes from the selected units and output them
> as individual files, but there is some kind of problem which I have not
> managed yet.
> 
> GetFilenameAndTime tells me the byte position and the milliseconds of the
> used file, and the selected unit tells me which duration in bytelength it
> has. But when I output it (using the original file), I get cracks and noise.
> I am not sure yet why... 
> 
> MfG,
> Westermann
> 
> -------- Original-Nachricht --------
> > Datum: Mon, 13 Sep 2010 17:32:08 +0200
> > Von: Ingmar Steiner <ingmar.steiner at dfki.de>
> > An: "Thorsten Westermann" <thorstenwestermann at gmx.net>
> > CC: mary-users at dfki.de
> > Betreff: Re: [mary-users] Bug with commas?
> 
> > Dear Thorsten,
> > 
> > first of all, could you please confirm that this issue persists with the
> > current Mary release 4.1.1?
> > 
> > If so, please provide details on which voice import component first
> > truncates your utterance, i.e. at which point in the voicebuilding
> process is the
> > second clause dropped?
> > 
> > In addition, please make sure that you do not have an XML file in
> > BASEDIR/rawmaryxml that could override the text.
> > 
> > P.S. Are you really creating a diphone voice, or rather a unit selection
> > voice? =)
> > 
> > Best wishes,
> > 
> > /**
> >  * Ingmar Steiner
> >  * Researcher, Language Technology
> >  * German Research Center for Artificial Intelligence
> >  *
> >  * Campus D3 1 +1.18
> >  * D-66123 Saarbr?cken
> >  * Germany
> >  * Phone: ++49-681-857-75-5263 (NEW!)
> >  * Email: ingmar.steiner at dfki.de
> >  *
> >  * Deutsches Forschungszentrum f?r K?nstliche Intelligenz GmbH
> >  * Trippstadter Stra?e 122, D-67663 Kaiserslautern, Germany
> >  * Gesch?ftsf?hrung:
> >  * Prof. Dr. Dr. h.c. mult. Wolfgang Wahlster (Vorsitzender)
> >  * Dr. Walter Olthoff
> >  * Vorsitzender des Aufsichtsrats:
> >  * Prof. Dr. h.c. Hans A. Aukes
> >  * Amtsgericht Kaiserslautern, HRB 2313
> >  */
> > 
> > On 13 Sep 2010, at 16:58, Thorsten Westermann wrote:
> > 
> > > Hello!
> > > 
> > > can anyone please confirm (the bug) that if you place a comma in a
> > sentence in the txt.done.data file, only the first half of the sentence
> is used
> > in the voice creation process for a diphone voice? I am talking about
> > version 4.0.0. 
> > > 
> > > For example:
> > > When you have the following sentence in your txt.done.data file:
> > > 
> > > "Mein Name ist Thorsten, das wollte ich mal gesagt haben."
> > > 
> > > ... then it will only process the "Mein Name ist Thorsten".
> > > 
> > > The "das wollte ich doch mal gesagt haben" will not occur in the
> > halfphones later on when I try to synthesize a text, that's why I guess
> that there
> > is a bug in the voice creation part.
> > > 
> > > As a workaround, can I remove the comma, or will it affect anything?
> > > I have not yet understood if commas influence the byte features of a
> > halfphone. Can anyone tell me?
> > > 
> > > Thank you for your reply.
> > > 
> > > MfG,
> > > Westermann
> > > 
> > > -- 
> > > GRATIS: Spider-Man 1-3 sowie 300 weitere Videos!
> > > Jetzt freischalten! http://portal.gmx.net/de/go/maxdome
> > > _______________________________________________
> > > Mary-users mailing list
> > > Mary-users at dfki.de
> > > http://www.dfki.de/mailman/cgi-bin/listinfo/mary-users
> > 
> 
> -- 
> GRATIS: Spider-Man 1-3 sowie 300 weitere Videos!
> Jetzt freischalten! http://portal.gmx.net/de/go/maxdome
> 
> 
> ------------------------------
> 
> _______________________________________________
> Mary-users mailing list
> Mary-users at dfki.de
> http://www.dfki.de/mailman/cgi-bin/listinfo/mary-users
> 
> 
> End of Mary-users Digest, Vol 51, Issue 9
> *****************************************

-- 
Neu: GMX De-Mail - Einfach wie E-Mail, sicher wie ein Brief!  
Jetzt De-Mail-Adresse reservieren: http://portal.gmx.net/de/go/demail


More information about the Mary-users mailing list