Thank you Markus. -----Original Message----- From: Markus Jelsma [mailto:markus.jel...@openindex.io] Sent: February 23, 2016 5:56 AM To: user@nutch.apache.org Subject: RE: fetch deletes all metadata except _csh_ and _rs_
Hello Adnane - your mails are received on the mailing list. There is probably no one that has read your mail and can respond to it. Markus -----Original message----- > From:Adnane Benjelloun <adn...@mediaplusplus.com> > Sent: Tuesday 23rd February 2016 1:25 > To: user@nutch.apache.org > Subject: Re: fetch deletes all metadata except _csh_ and _rs_ > > Hi, > > Can you please confirm if you receives my emails ? > > > On Feb 22, 2016, at 9:23 AM, Adnane Benjelloun <adn...@mediaplusplus.com> > > wrote: > > > > Hi everybody, > > > > No one has tried to help me. Any suggestion please ? > > > > Is there another place where I can ask my question if I'm not in the > > right list ?Best regards, > > > > Adnane > > > > --------------------------------------------------------- > > > > From: Adnane Benjelloun [mailto:adn...@mediaplusplus.com] > > Sent: February 16, 2016 10:04 PM > > To: user@nutch.apache.org > > Subject: fetch deletes all metadata except _csh_ and _rs_ > > > > Hello, > > > > This problem happens at the second time I crawl a page > > > > bin/nutch inject urls/ > > bin/nutch generate -topN 1000 > > bin/nutch fetch -all > > bin/nutch parse -force -all > > bin/nutch updatedb –all > > > > second time : > > > > bin/nutch generate -topN 1000 --> batchid changes for all existing > > pages bin/nutch fetch -all --> *** metadatas are delete for all > > pages already crawled ** bin/nutch parse -force -all bin/nutch > > updatedb –all > > > > I'm using mongodb > > > > Any Help please ? I’m not sure if it’s a nutch bug or it’s my > > misunderstanding on nutch. > > > > Best regards, > > > > Adnane > > > > > > > > >