Hi William,

in its current state, hunspell has some limitations which make this solution imperfect. First, in some languages only all flags of the base form of the word determine the part of speech information (two genders can share the same affixes, for example). Hunspell is unable to distinguish that at the moment. Second, there is no support for stemming and hunstem in OOo (yet).

Nemeth Laszlo is planning to remove (1) but due to quite lengthy process of adding any feature to OOo it's hard to say when (2) will be a thing of the past. So all you can do is to write a hunspell dictionary parser, and implement the functions in Perl, or try JMorph (Java hunspell stemming port, available somewhere on Laci's university website, dig in the archives of his group for the link). You are using Java interface anyway in CoGrOO, so that should be a good temporary solution. Note that this Java interface needs some fixing (crashes on some dictionaries) but is almost a complete port.

Best,
Marcin

William Silva napisał(a):
Hi,

Have anyone tried to use Hunspell for morphological analysis? In our grammar
checker development (CoGrOO), we are using a morphological dictionary we
wrote. But it is big, especially because we didn't care about redundant
data, this would be solved using the Hunspell affixes, and even better, the
grammar checker and spell checker would share the same dictionary.
Does anyone know if the OOo make any interface available to its Hunspell, so
a grammar checker could use this interface to query the dictionaries?

Thanks!

William


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to