Re: Crawled data not inserting in the tables

2014-09-30 Thread Lewis John Mcgibbney
Hi, So did you get this sorted out? I am unsure if you achieved persistence of data. Thanks Lewis On Tuesday, September 30, 2014, Krishnanand, Kartik < kartik.krishnan...@bankofamerica.com> wrote: > Hi, Lewis > > > > Thank you for replying. I apologize in advance for asking what might well > be

RE: Crawled data not inserting in the tables

2014-09-30 Thread Krishnanand, Kartik
Hi, Lewis Thank you for replying. I apologize in advance for asking what might well be a stupid question. We are using the Crawler/InjectorJob/GeneratorJob/FetcherJob/ParserJob source code from the Nutch codebase without any modifications and calling the binary directly. @Lewis: I used the d

Re: Crawled data not inserting in the tables

2014-09-30 Thread Lewis John Mcgibbney
Can you also make sure that the cluster name and fully qualified address and port agree between mapping and Gora.properties Thanks On Tuesday, September 30, 2014, Renato Marroquín Mogrovejo < renatoj.marroq...@gmail.com> wrote: > Hi Kartik, > > If TTL hasn't been set or if it has been set to 0, t

Re: Crawled data not inserting in the tables

2014-09-30 Thread Renato Marroquín Mogrovejo
Hi Kartik, If TTL hasn't been set or if it has been set to 0, then Gora is not using any TTL[1] and all your data should be persisted without any problems. Maybe this has to do something with the url generating/fetching process? Could you determine during which process the data is changing? (gener

RE: Crawled data not inserting in the tables

2014-09-30 Thread Krishnanand, Kartik
Hi, Talat I am afraid that I do not understand. We have set the “ttl” value to 0, which is the default value. We don’t have any need portions of data that needs to be deleted. For now, I am using a single node cluster, for us the gc_grace_seconds=”0” default value would be a valid value. Hav