On Wed, Apr 10, 2013 at 4:22 PM, Jörn Kottmann <[email protected]> wrote: > > The current dictionary package has an API to read and serialize a dictionary > from and to the > XML format. That could be changed to some binary based format which could be > much faster. > But as far as I understand is the main issue we have is the representation > of the dictionary in memory > and not the serialization of it.
Having a lemmatization API like this would be very nice. Using JWNL just for a lemma lookup was quick but I actually think is overkill. I have created now a "wordform pos lemma" dictionary from WordNet and will perform hashmap lookups on it. If this could be done using an adapted Dictionary API for a more efficient memory and speed performance, that will be great. Cheers, Rodrigo
