Hi,
in the fetcher line 192 in case the status is NOTMODIFIED we collect
null as content but we already have the content.
I'm worry what is happen with a page that does not change for 60
days, since the concept of nutch is do delete segments that are older
than "db.default.fetch.interval",
Stefan Groschupf wrote:
Hi,
in the fetcher line 192 in case the status is NOTMODIFIED we collect
null as content but we already have the content.
I'm worry what is happen with a page that does not change for 60 days,
since the concept of nutch is do delete segments that are older than
"db.de