Hi Lewis,

I've created patch NUTCH-1294-v3.patch. 
Here are the steps I followed:

$ svn checkout http://svn.apache.org/repos/asf/nutch/tags/release-2.2.1
$ cd release-2.2.1
$ patch -p0 < NUTCH-1294-v2.patch
# manually patched "src/bin/nutch" and "conf/log4j.properties"
$ ant
$ svn diff > NUTCH-1294-v3.patch
# attached the new patch up on jira

With this patch, all the files in the patch are deployed successfully. In
the previous patch (v2), "src/bin/nutch" and "conf/log4j.properties" had to
be patched manually.

As I said, the task is working fine, i.e. documents with status = 3 are
removed from Solr.
The only caveat is that you need to set storage.crawl.id in nutch-site.xml
if the crawling was done with a crawl_id, otherwise the solr clean task will
not do anything.

Thanks,
Claudiu.






--
View this message in context: 
http://lucene.472066.n3.nabble.com/SolrClean-not-available-in-nutch-2-x-tp4081385p4081790.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to