Hi,

how do you manage recrawling with current nutch versions (1.2 or 1.3)?
I have seen some scripts in the wiki for older versions but none for 1.2 or 
above.
Generally db.fetch.interval.default seems ok for my use case but I have 
situations where I need to
force a recrawl. How would you do this? bin/nutch crawl seems not to have a 
option for this.
Optionally it would be perfect to only force a recrawl for urls matching a 
defined regex.

Any ideas?

Reply via email to