[mary-dev] WikipediaProcessor: Japanese Processing Exception

Hind Abdul-Khaleq habdolkhaleq at yahoo.com
Sun Nov 8 05:41:56 CET 2009


The problem solved with "-Xmx2000m"   given the vm and without other changes to the source .
Thanks a lot and All the Best.


--- On Wed, 10/28/09, Hind Abdul-Khaleq <habdolkhaleq at yahoo.com> wrote:

From: Hind Abdul-Khaleq <habdolkhaleq at yahoo.com>
Subject: Re: [mary-dev] WikipediaProcessor: Japanese Processing Exception
To: mary-dev at dfki.de
Date: Wednesday, October 28, 2009, 11:45 AM

I'm getting this exception while processing Japanese
I changed the encoding to "EUC_JP" at the line

            word = new String(wordBytes, "UTF8");  
in  marytts.tools.dbselection.DBHandler.getMostFrequentWords(DBHandler.java:1366)

but it produced another exception at the next line:
         wordList.put(word, new Integer(rs.getInt(2)));
Exception in
 thread "main" java.lang.OutOfMemoryError: Java heap space
    at java.util.HashMap.resize(HashMap.java:462)
    at java.util.HashMap.addEntry(HashMap.java:755)
    at java.util.HashMap.put(HashMap.java:385)
    at marytts.tools.dbselection.DBHandler.getMostFrequentWords(DBHandler.java:1367)
    at marytts.tools.dbselection.WikipediaMarkupCleaner.updateWordList(WikipediaMarkupCleaner.java:953)
    at
 marytts.tools.dbselection.WikipediaMarkupCleaner.processWikipediaPages(WikipediaMarkupCleaner.java:1133)
    at marytts.tools.dbselection.WikipediaProcessor.main(WikipediaProcessor.java:368)

   also I do "-Xmx1000m",... so what to do?

--- On Wed, 10/28/09, Hind Abdul-Khaleq <habdolkhaleq at yahoo.com> wrote:

From: Hind Abdul-Khaleq <habdolkhaleq at yahoo.com>
Subject: [mary-dev] WikipediaProcessor: Japanese Exception
To: mary-dev at dfki.de
Date: Wednesday, October 28, 2009, 11:34 AM

Exception in thread "main" java.lang.OutOfMemoryError: GC overhead limit exceeded
    at java.util.Arrays.copyOf(Arrays.java:2882)
    at java.lang.StringCoding.safeTrim(StringCoding.java:75)
    at java.lang.StringCoding.access$100(StringCoding.java:34)
    at
 java.lang.StringCoding$StringDecoder.decode(StringCoding.java:151)
    at
 java.lang.StringCoding.decode(StringCoding.java:173)
    at java.lang.String.<init>(String.java:443)
    at java.lang.String.<init>(String.java:515)
    at marytts.tools.dbselection.DBHandler.getMostFrequentWords(DBHandler.java:1366)
    at marytts.tools.dbselection.WikipediaMarkupCleaner.updateWordList(WikipediaMarkupCleaner.java:953)
    at marytts.tools.dbselection.WikipediaMarkupCleaner.processWikipediaPages(WikipediaMarkupCleaner.java:1133)
    at marytts.tools.dbselection.WikipediaProcessor.main(WikipediaProcessor.java:368)






      
-----Inline Attachment Follows-----

_______________________________________________
Mary-dev mailing list
Mary-dev at dfki.de
http://www.dfki.de/mailman/cgi-bin/listinfo/mary-dev





      
-----Inline Attachment Follows-----

_______________________________________________
Mary-dev mailing list
Mary-dev at dfki.de
http://www.dfki.de/mailman/cgi-bin/listinfo/mary-dev





      


      
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.dfki.de/pipermail/mary-dev/attachments/20091107/d9ad3d4e/attachment.htm 


More information about the Mary-dev mailing list