Hi all,

Because of a patch to take care of I need to have a basic understanding
about the meaning of entries in a hyphenation dictionary.

If I look e.g. into the en-US hyphenation dictionary there are entries like

.e2a2r
.u4n5k2
a4c2a2r
am2i4no
4and
an5e2st.

What is the meaning of those?
First I thought each entry to be a word part (sub string) where the
numbers denote possible hyphenation points and the value the quality of
that hyphenation point.
But that seems not to be true. At least I do not know a word with a sub
string of 'ear' that can be hyphenated after each of those characters.
Similar for 'acar'. And what is the meaning of the '.' characters?

Can someone shed some light into this?


Note: The actual problem is with some Indic script where I need to check
if entries like
1ઐ1
ल2्2
will get processed correctly by the hyphenator. (But I thought a western
example might be more readable for most subscribers.)
Here the specific problem arises from the characters not being
represented by single bytes...


Thomas


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lingucomponent.openoffice.org
For additional commands, e-mail: dev-h...@lingucomponent.openoffice.org

Reply via email to