On Wed, Apr 10, 2013 at 4:22 PM, Jörn Kottmann <[email protected]> wrote:
>
> The current dictionary package has an API to read and serialize a dictionary
> from and to the
> XML format. That could be changed to some binary based format which could be
> much faster.
> But as far as I understand is the main issue we have is the representation
> of the dictionary in memory
> and not the serialization of it.

Having a lemmatization API like this would be very nice.

Using JWNL just for a lemma lookup was quick but I actually think is
overkill. I have created now a "wordform pos lemma" dictionary from
WordNet and will perform hashmap lookups on it. If this could be done
using an adapted Dictionary API for a more efficient memory and speed
performance, that will be great.

Cheers,

Rodrigo

Reply via email to