Hi,

See extended ALetter definitions of the Hungarian word breaking rules:

http://svn.services.openoffice.org/ooo/branches/OOO310/i18npool/source/breakiterator/data/dict_word_hu.txt
http://svn.services.openoffice.org/ooo/branches/OOO310/i18npool/source/breakiterator/data/edit_word_hu.txt

By the way, it also contains numbers and other special signs, because
Hungarian uses their affixed forms. (For example, "with 25%" is
"25%-kal" in Hungarian, and not the frequent bad form "25%-al"):

$ALetter   = [\u0002 [:Alphabetic:] [:name= COMMERCIAL AT:] [:name=
HEBREW PUNCTUATION GERESH:]
                [:name = PERCENT SIGN:] [:name = PER MILLE SIGN:]
[:name = PER TEN THOUSAND SIGN:]
                [:name = SECTION SIGN:] [:name = DEGREE SIGN:] [:name
= EURO SIGN:]
                [:name = HYPHEN-MINUS:] [:name = EN DASH:] [:name = EM DASH:]
                [:name = DIGIT ZERO:]
                [:name = DIGIT ONE:]
                [:name = DIGIT TWO:]
                [:name = DIGIT THREE:]
                [:name = DIGIT FOUR:]
                [:name = DIGIT FIVE:]
                [:name = DIGIT SIX:]
                [:name = DIGIT SEVEN:]
                [:name = DIGIT EIGHT:]
                [:name = DIGIT NINE:]
                           - $Ideographic
                           - $Katakana
                           - $Hangul
                           - [:Script = Thai:]
                           - [:Script = Lao:]
                           - [:Script = Hiragana:]];

Best regards,
László


2009/6/10 Thomas Lange - Sun Germany - ham02 - Hamburg <thomas.la...@sun.com>:
>
> Hi all,
>
> Does anyone know how to modify the ICU (i.e. probably the word.txt file)
> to allow for pre- and postfix "HYPHEN-MINUS" and "EN DASH" as part of
> the word (in order to get them passed on to the spell checker as well)?
>
> This would be useful e.g. for German where there are correct word parts
> like
>  "Arbeits- und Verwaltungsrecht"
>
> Regards,
> Thomas
>
>
> BTW: Is there any other language where hyphens/dashes should be handled
> similarly?
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@lingucomponent.openoffice.org
> For additional commands, e-mail: dev-h...@lingucomponent.openoffice.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lingucomponent.openoffice.org
For additional commands, e-mail: dev-h...@lingucomponent.openoffice.org

Reply via email to