Hi Rodrigo, No, I am not using multi-threading, it's a simple Java program, took help from openNLP documentation but it is worth mentioning over here is that as the corpus is containing 4 million records so my Java program running in eclipse was frequently giving me java heap space issue (out of memory issue) so I investigate a bit and found that process was taking around 10GB memory for building the model so i increased the memory to 10 GB using -Xmx parameter. so it worked properly but took 3 hours. Thanks-NIkhil From: Rodrigo Agerri <rage...@apache.org> To: "dev@opennlp.apache.org" <dev@opennlp.apache.org>; nikhil jain <nikhil_jain1...@yahoo.com> Cc: "us...@opennlp.apache.org" <us...@opennlp.apache.org> Sent: Wednesday, November 19, 2014 2:17 AM Subject: Re: Need to speed up the model creation process of OpenNLP Hi,
Are you using multithreading, lots of threads, RAM memory? R On Tue, Nov 18, 2014 at 5:46 PM, nikhil jain <nikhil_jain1...@yahoo.com.invalid> wrote: > Hi, > I asked below question yesterday, did anyone get a chance to look at this. > I am new in OpenNLP and really need some help. Please provide some clue or > link or example. > ThanksNIkhil > From: nikhil jain <nikhil_jain1...@yahoo.com.INVALID> > To: "us...@opennlp.apache.org" <us...@opennlp.apache.org>; Dev at Opennlp >Apache <dev@opennlp.apache.org> > Sent: Tuesday, November 18, 2014 12:02 AM > Subject: Need to speed up the model creation process of OpenNLP > > Hi, > I am using OpenNLP Token Name Finder for parsing the unstructured data. I > have created a corpus of about 4 million records. When I am creating a model > out of the training set using openNLP API's in Eclipse using default setting > (cut-off 5 and iterations 100), process is taking a good amount of time, > around 2-3 hours. > Can someone suggest me how can I reduce the time as I want to experiment with > different iterations but as the model creation process is taking so much > time, I am not able to experiment with it. This is really a time consuming > process. > Please provide some feedback. > Thanks in advance.Nikhil Jain > >