but it also consume less memory after loading. This LGPL dictionary library uses a FSA data structure that requires less memory than Hashtable to store 500k words, and also is fast enough during runtime.
On Wed, Jul 6, 2011 at 11:29 AM, [email protected] < [email protected]> wrote: > > > On Wed, Jul 6, 2011 at 11:18 AM, Jörn Kottmann <[email protected]> wrote: > >> On 7/6/11 4:10 PM, [email protected] wrote: >> >>> In my project I'll have to use my own dictionary implementation. I don't >>> like the OpenNLP default one because it takes longer to load. I use a >>> LGPL >>> library that creates really small binary dictionaries that doesn't need >>> to >>> be parse during initialization. >>> >> >> How much longer does our dictionary need to load? >> > > I don't have updated numbers. I was using it in 1.4 and it was few seconds > faster when I moved to the binary dictionaries. My Portuguese tag dictionary > has 500k entries. >
