Hi,

Do you have that backwards? According to my dictionary the correct hyphenation for guardian is guard-i-an?

Perhaps you simply swapped the OOo <-> TeX columns?

Kevin

On Oct 31, 2005, at 6:08 PM, Daniel Naber wrote:

Hi,

the hyphenation code in OOo is supposed to work like the one in TeX.... but it doesn't. I did an evaluation of large word lists and the result is that of about 122,000 words in English, 27,000 have a different hyphenation in
OOo than in TeX. Some examples:

OOo <-> TeX
abor-ted   aborted
guard-i-an    guardian
briskness   brisk-ness

The same problem occurs for German words. In most (all?) cases the TeX
hyphenation seems to be correct, the one by OOo seems to be incorrect.

Luckily Walter Wojcik is working on a new version of the hyphenation code. For the record I'll now explain how this automatic evaluation works so it
can be done again when the new code is ready:

First, build the word lists using hunpsell's unmunch.

Then get the OOo hyphenator: download ALTLinux_hyph.zip from this page:
http://lingucomponent.openoffice.org/hyphenator.html
Compile it and call:
./example hyph_en_US.dic wordlist

Get the TeX hyphenation: Build a file "hpyhen.tex" with contents like this,
listing all words:

\documentclass{minimal}
\usepackage[latin1]{inputenc}
\usepackage[T1]{fontenc}
\showhyphens{A}
\showhyphens{abalone}
\showhyphens{abalones}
(...all other words here...)
\makeatletter\@@end

Then call "latex hyphen.tex". The log file then contains the hyphenated words, plus some debugging output. Remove that debugging output and you
can then compare both files with diff.

Regards
 Daniel

--
http://www.danielnaber.de

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: dev- [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to