Re: Next Steps for OpenNLP

Jörn Kottmann Mon, 07 Oct 2013 15:30:12 -0700

On 10/07/2013 11:00 PM, Michael Schmitz wrote:

Do you know how many sentences/tokens were annotated for the OpenNLP
POS and CHUNK models?  Do you have an idea of the "sweet spot" for
number of annotations vs performance?

If the model gets bigger the computations get more complex, but as faras I knowthe effect of the model not fitting anymore in the CPU cache is muchmore significant thenthat. I am using hash based int features to reduce the memory footprintin the name finder.

I don't have much experience with the Chunker or Pos Tagger in regardsto performance, butit should be easy to do a series of tests, the command line tools havebuilt in performance monitoring.


Jörn

Re: Next Steps for OpenNLP

Reply via email to