nutch is loosing not modified pages

2006-05-08 Thread Stefan Groschupf
Hi, in the fetcher line 192 in case the status is NOTMODIFIED we collect null as content but we already have the content. I'm worry what is happen with a page that does not change for 60 days, since the concept of nutch is do delete segments that are older than "db.default.fetch.interval",

Re: nutch is loosing not modified pages

2006-05-08 Thread Andrzej Bialecki
Stefan Groschupf wrote: Hi, in the fetcher line 192 in case the status is NOTMODIFIED we collect null as content but we already have the content. I'm worry what is happen with a page that does not change for 60 days, since the concept of nutch is do delete segments that are older than "db.de