No, that is not possible. But it should be rather easy to add Jexl support for 
CrawlDB update that allows you to conditionally delete CrawlDB entries via a 
Jexl script.

There is already Jexl support in generator and CrawlDB reader.

Markus

-----Original message-----
> From:Manish Verma <m_ve...@apple.com>
> Sent: Tuesday 12th July 2016 16:09
> To: user@nutch.apache.org
> Subject: Re: Delete db_gone from crawdb
> 
> I mean like solrclean and -deletegone options do we have any option to delete 
> it from crawldb, using purge we have to change notch-site property and we 
> don’t want to turn purge on all time.
> Can we specify something in run time to delete these from crawldb(some script 
> or runtime argument).
> 
> Regards,
> MV
> 
> > On Jul 12, 2016, at 1:48 AM, Markus Jelsma <markus.jel...@openindex.io> 
> > wrote:
> > 
> > Hi - what do you mean by control? In any case, you can turn it on once and 
> > purge db_gone, then turn if off again, right?
> > Markus
> > 
> > 
> > 
> > -----Original message-----
> >> From:Manish Verma <m_ve...@apple.com>
> >> Sent: Tuesday 12th July 2016 8:08
> >> To: user@nutch.apache.org
> >> Subject: Delete db_gone from crawdb
> >> 
> >> Hi,
> >> 
> >> We want to delete db_gone docs from crawled without turing purge on.
> >> We want to control this so that can delete these when ever we wish to 
> >> clean crawldb.
> >> 
> >> Regards,
> >> MV
> >> 
> >> 
> 
> 

Reply via email to