To comment on the following update, log in, then open the issue:
http://www.openoffice.org/issues/show_bug.cgi?id=66939





------- Additional comments from nem...@openoffice.org Fri Feb 27 09:11:45 
+0000 2009 -------
You are welcome. I'm very glad of the success, too, especially because this was
an old and serious problem.

I have used the he_IL version 1.0 of the OOo extension "he-IL-dict.oxt". I have
checked the new dictionary with the following commands (tabulated input and the
option -1 need for the correct tokenization):

Check the base dictionary: 

$ cat he_IL.dic | sed 's#/.*$##' | awk  '{print$0"\t"}' | LC_ALL=C
~/hunspell-1.2.8/src/tools/hunspell -d new_he_IL -1 -l

Check samples:

$ unmunch he_IL.dic he_IL.aff >he_IL.all  # need 1.3 GB disk space
$ time sed -n '1~1000p' he_IL.all >sample # sample with 100000 words
$ cat sample | awk  '{print$0"\t"}' | LC_ALL=C time
~/hunspell-1.2.8/src/tools/hunspell -d new_he_IL -1 -l

You can add the doubleaffixcompress and affixcompress scripts to your Hspell
distribution (doubleaffixcompress will be a standard tool of the next Hunspell
release, too). Also you may need to extend the affix file to handle Niqqut by
the IGNORE or the new ICONV/OCONV features of Hunspell.

(A further optimization could be to use hzip Hunspell compressed format:
$ ~/hunspell-1.2.8/src/tools/hzip *
$ ls -lh new_he_IL_alias.*
-rw-r--r-- 1 laci laci 634K 2009-02-27 09:55 new_he_IL_alias.aff.hz
-rw-r--r-- 1 laci laci 116K 2009-02-27 09:55 new_he_IL_alias.dic.hz
Hunspell library searches the hzip compressed files, if the given .dic and .aff
files are missing:
$ rm *aff *dic
$ ~/hunspell-1.2.8/src/tools/hunspell -d new_he_IL_alias
$ ~/hunspell-1.2.8/src/tools/hunspell -d new_he_IL_alias
Hunspell 1.2.8
a
& a 7 0: ו, ה, כ, ש, ב, מ, ל
But likely OpenOffice.org 3.1 and Firefox 3.1 extension formats don't support
hzip installation.)
Regards, László

---------------------------------------------------------------------
Please do not reply to this automatically generated notification from
Issue Tracker. Please log onto the website and enter your comments.
http://qa.openoffice.org/issue_handling/project_issues.html#notification

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@framework.openoffice.org
For additional commands, e-mail: issues-h...@framework.openoffice.org


---------------------------------------------------------------------
To unsubscribe, e-mail: allbugs-unsubscr...@openoffice.org
For additional commands, e-mail: allbugs-h...@openoffice.org

Reply via email to