Hello,

 

This problem happens at the second time I crawl a page

 

bin/nutch inject urls/

bin/nutch generate -topN 1000

bin/nutch fetch -all

bin/nutch parse -force -all

bin/nutch updatedb -all

 

second time :

 

bin/nutch generate -topN 1000 --> batchid changes for all existing pages

bin/nutch fetch -all --> *** metadatas are delete for all pages already
crawled **

bin/nutch parse -force -all

bin/nutch updatedb -all

 

I'm using mongodb

 

Any Help please ? I'm not sure if it's a nutch bug or  it's my
misunderstanding on nutch.

 

Best regards,


Adnane

 

 

 

Reply via email to