Hi, NutchIndexAction is indeed prepared to handle updates but the methods are not implemented. In case of Solr, it still does an internal add/delete for updated documents, and to do so, you must have all fields stored="true". So in almost all cases, it is more efficient not to store all fields and send some additional data over the wire. You can implement it though.
Markus -----Original message----- > From:Ali Nazemian <alinazem...@gmail.com> > Sent: Tuesday 1st July 2014 15:31 > To: user@nutch.apache.org > Subject: Changing nutch for update documents instead of add new ones > > Dears, > Hi, > I am going to do some changes in nutch default behavior. I want to change > nutch solr index (indexWriter class) in a way that instead of adding new > document to solr, old documents are updated. I saw an "update" method > inside this class. Is that implemented for this purpose? If no what is the > purpose of this method? Another question is doing such thing (changing > indexWriter to update document instead of adding them) would affect my > performance for whole web crawling? > Best regards. > > -- > A.Nazemian >