Re: Training NameFinder with large corpus

Miljana Mladenovic Mon, 07 Oct 2013 05:01:20 -0700

Learn about map-reduce strategy over big data. For example:http://wiki.apache.org/hadoop/MapReduce

Regards, Mixie

On Mon, 07 Oct 2013 13:53:33 +0200, Jeffrey Zemerick<[email protected]> wrote:

Hi,

I'm new to OpenNLP (and NLP in general) and I'm trying to train the
NameFinder on a large corpus (nearly 1 GB). After a few hours it willfail
with a GC overhead limit exception. Do you have any suggestions on how I
might could accomplish this? Is it possible to train the model on partsof
the input at a time? I tried increasing the memory available but that
seemed to just prolong the exception.

Thanks for any help.

Jeff



--
Using Opera's revolutionary e-mail client: http://www.opera.com/mail/

Re: Training NameFinder with large corpus

Reply via email to