Hi Lewis, I disturb you for I thought you may help me: I'm seeking a
Nutch/Solr expert for a new project. I wish he/she be in Europe,
preferably Italy or Switzerland.
If you know somebody who could fit in please leave them my e.mail address.
Thank You
Andrea Lanzoni
Il 12/04/2013 23:36, Lewis John McGibbney (JIRA) ha scritto:
[
https://issues.apache.org/jira/browse/NUTCH-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1556:
----------------------------------------
Fix Version/s: 2.2
enabling updatedb to accept batchId
------------------------------------
Key: NUTCH-1556
URL: https://issues.apache.org/jira/browse/NUTCH-1556
Project: Nutch
Issue Type: Improvement
Affects Versions: 2.2
Reporter: kaveh minooie
Fix For: 2.2
Attachments: NUTCH-1556.patch
So the idea here is to be able to run updatedb and fetch for different batchId
simultaneously. I put together a patch. it seems to be working ( it does skip
the rows that do not match the batchId), but I am worried if and how it might
affect the sorting in the reduce part. anyway check it out.
it also change the command line usage to this:
Usage: DbUpdaterJob (<batchId> | -all) [-crawlId <id>]
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira