Hi,
Do you have that backwards? According to my dictionary the correct
hyphenation for guardian is guard-i-an?
Perhaps you simply swapped the OOo <-> TeX columns?
Kevin
On Oct 31, 2005, at 6:08 PM, Daniel Naber wrote:
Hi,
the hyphenation code in OOo is supposed to work like the one in
TeX.... but
it doesn't. I did an evaluation of large word lists and the result
is that
of about 122,000 words in English, 27,000 have a different
hyphenation in
OOo than in TeX. Some examples:
OOo <-> TeX
abor-ted aborted
guard-i-an guardian
briskness brisk-ness
The same problem occurs for German words. In most (all?) cases the TeX
hyphenation seems to be correct, the one by OOo seems to be incorrect.
Luckily Walter Wojcik is working on a new version of the
hyphenation code.
For the record I'll now explain how this automatic evaluation works
so it
can be done again when the new code is ready:
First, build the word lists using hunpsell's unmunch.
Then get the OOo hyphenator: download ALTLinux_hyph.zip from this
page:
http://lingucomponent.openoffice.org/hyphenator.html
Compile it and call:
./example hyph_en_US.dic wordlist
Get the TeX hyphenation: Build a file "hpyhen.tex" with contents
like this,
listing all words:
\documentclass{minimal}
\usepackage[latin1]{inputenc}
\usepackage[T1]{fontenc}
\showhyphens{A}
\showhyphens{abalone}
\showhyphens{abalones}
(...all other words here...)
\makeatletter\@@end
Then call "latex hyphen.tex". The log file then contains the
hyphenated
words, plus some debugging output. Remove that debugging output and
you
can then compare both files with diff.
Regards
Daniel
--
http://www.danielnaber.de
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: dev-
[EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]