On 7/6/11 4:38 PM, [email protected] wrote:
but it also consume less memory after loading. This LGPL dictionary library uses a FSA data structure that requires less memory than Hashtable to store 500k words, and also is fast enough during runtime.
Yeah, it would be nice to have a better dictionary in OpenNLP, we also discussed the usage of bloom-filters, which I believe might be good enough for feature generation anyway in many cases. Jörn
