Hi, See extended ALetter definitions of the Hungarian word breaking rules:
http://svn.services.openoffice.org/ooo/branches/OOO310/i18npool/source/breakiterator/data/dict_word_hu.txt http://svn.services.openoffice.org/ooo/branches/OOO310/i18npool/source/breakiterator/data/edit_word_hu.txt By the way, it also contains numbers and other special signs, because Hungarian uses their affixed forms. (For example, "with 25%" is "25%-kal" in Hungarian, and not the frequent bad form "25%-al"): $ALetter = [\u0002 [:Alphabetic:] [:name= COMMERCIAL AT:] [:name= HEBREW PUNCTUATION GERESH:] [:name = PERCENT SIGN:] [:name = PER MILLE SIGN:] [:name = PER TEN THOUSAND SIGN:] [:name = SECTION SIGN:] [:name = DEGREE SIGN:] [:name = EURO SIGN:] [:name = HYPHEN-MINUS:] [:name = EN DASH:] [:name = EM DASH:] [:name = DIGIT ZERO:] [:name = DIGIT ONE:] [:name = DIGIT TWO:] [:name = DIGIT THREE:] [:name = DIGIT FOUR:] [:name = DIGIT FIVE:] [:name = DIGIT SIX:] [:name = DIGIT SEVEN:] [:name = DIGIT EIGHT:] [:name = DIGIT NINE:] - $Ideographic - $Katakana - $Hangul - [:Script = Thai:] - [:Script = Lao:] - [:Script = Hiragana:]]; Best regards, László 2009/6/10 Thomas Lange - Sun Germany - ham02 - Hamburg <thomas.la...@sun.com>: > > Hi all, > > Does anyone know how to modify the ICU (i.e. probably the word.txt file) > to allow for pre- and postfix "HYPHEN-MINUS" and "EN DASH" as part of > the word (in order to get them passed on to the spell checker as well)? > > This would be useful e.g. for German where there are correct word parts > like > "Arbeits- und Verwaltungsrecht" > > Regards, > Thomas > > > BTW: Is there any other language where hyphens/dashes should be handled > similarly? > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: dev-unsubscr...@lingucomponent.openoffice.org > For additional commands, e-mail: dev-h...@lingucomponent.openoffice.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lingucomponent.openoffice.org For additional commands, e-mail: dev-h...@lingucomponent.openoffice.org