Hi Laurent,

I have successfully checked all official MySpell dictionaries with Hunspell.

Only there was a few problems with Myspell's unmunch tool and Myspell
3.0. Unmunch doesn't check the stripping characters, and generates bad words
from affix rules with weak or bad conditions. (Affix rules with bad conditions
have incompatible condition and stripping characters.) Myspell 3.0 has an
obsolote cleanword function and some dictionary words will be wrong before the
check:

wget
ftp.services.openoffice.org/pub/OpenOffice.org/contrib/dictionaries/en_GB.zip
unzip -d en_GB en_GB.zip
cd en_GB
~/MySpell-3.0/example en_GB.aff en_GB.dic <(unmunch en_GB.dic en_GB.aff
2>/dev/null) | grep 'incorrect!'
"'twixt" is incorrect!
"'mongst" is incorrect!
"'strewth" is incorrect!
"'twas" is incorrect!
"'struth" is incorrect!
"'neath" is incorrect!

For English this is not attractive in OpenOffice.org, because OpenOffice.org's
default "en_GB-oed" and "en_US" dictionaries contain the bad "twixt", "mongst"
etc. forms without apostrophe. (Perhaps intentionally, because apostrophe is
bad, but frequent(?) left quotion mark in English.)
But there are several other dictionaries with "lost" words in OOo.

I think, Hunspell handles all official OOo spell checking dictionaries
correctly. But I will fix unmunch, and retest Hunspell with OOo's dictionaries.
I will send the result of the test with the testing shell script.

I will make a strict quality control for Hunspell, with similar tests before
every Hunspell release.

You are right, the right target is not the OOo 2.0.0 for Hunspell.
The right target is a beta version before 2.0.0.
Unfortunatelly, these items were not displayed in OOo Issuezilla.
We need early release for Hunspell (like for any free software). I think, OOo
2.0.1 will be only (but significant) patched version of OOo 2.0.0.
Previous versions of Hunspell have been tested in official Hungarian
OpenOffice.org (http://hu.openoffice.org) for a long time.

Thanks for your comments.

Best regards

Laci



Quoting Laurent Godard <[EMAIL PROTECTED]>:

> Hi Nemeth, hi Kevin
>
> First thanks for such an extension of spellchecking features.
>
> Nevertheless, we wonder if 2.0.0 is the correct target version ?
> http://www.openoffice.org/issues/show_bug.cgi?id=52383
>
> Is it ready ? canit create additional feedback regarding spellcheckers ?
>
> As huspell is designed as an addon and there is a need of an intensive
> and systematic verification of all existing dictionaries, i would
> propose this:
>
> - deliver it as an addon, officially supported by the lingucomponent
> project or the comming scripting one.
> Then the native-lang projects needing the new features can use it
> an installer, such as DicOOo can be made for it if really needed but
> addon installation is now really easy under OOo2
>
> - in the mean time, organize a campaign of testing the existing
> dictionaries. This could be lauched by contacting Native-Lang projects,
> asking them for setting this inside their communties
>
> - target the hunspell official replacement of myspel to 2.0.1
>
> any remark welcommed and sure i can help setting up the testing campaign
>
> Thanks
>
> Laurent
>
> --
> Laurent Godard <[EMAIL PROTECTED]> - Ingénierie OpenOffice.org
> Indesko >> http://www.indesko.com
> Nuxeo CPS >> http://www.nuxeo.com - http://www.cps-project.org
> Livre "Programmation OpenOffice.org", Eyrolles 2004
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>
>




----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to