subject:"WebCrawler Connector code"

Re: WebCrawler Connector code

2020-07-07 Thread Karl Wright

Hi Ritika, You do not want to load the list of seeds on every document processing that is done for performance reasons. The connector API does not support accessing arbitrary job data in part for this reason. You should NEVER be calling JobManager methods from a connector either. You have

Re: WebCrawler Connector code

2020-07-07 Thread ritika jain

Hi Karl, Many thanks for your response.!! The problem I faced is to get Current JobID , so that's why I used the JobStatus class. another thing is to get the seeds corresponding to the running JOb ID. activities object is having value of job ID set in its constructor object. But no way to get

Re: WebCrawler Connector code

2020-07-06 Thread Karl Wright

Hi Ritika, ' My requirement is to abort a job whenever a seed-corresponding site is down or returning some 5xx response codes. ' (1) Connector methods, like addSeedDocuments(), are called by the framework. You do not call them yourself when you write a connector. So you are looking in the

WebCrawler Connector code

2020-07-06 Thread ritika jain

Hi All, I have confusion regarding WebCrawler connector code.My requirement is to abort a job whenever a seed-corresponding site is down or returning some 5xx response codes. So I have used the jobManager errorAbort method for this in addSeedDocuments method of Webcrawlerconnector.java..,