[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12361311 ]
Andrzej Bialecki commented on NUTCH-61:
I'm working on this, the patch will be available in a couple of days. I could
use then your help with review and testing... ;-)
[
http://issues.apache.org/jira/browse/NUTCH-95?page=comments#action_12361310 ]
Andrzej Bialecki commented on NUTCH-95:
Yes, you are right - we first need to order the segments by date, so that we
always keep the latest revisions of a page (possibl
[
http://issues.apache.org/jira/browse/NUTCH-61?page=comments#action_12361302 ]
byron miller commented on NUTCH-61:
---
Is there a patch modified for the current branch or should i take a stab at
this?
> Adaptive re-fetch interval. Detecting umodified conte
[
http://issues.apache.org/jira/browse/NUTCH-55?page=comments#action_12361301 ]
byron miller commented on NUTCH-55:
---
You can close this ticket, duplicate of ticket NUTCH-59
> Create dmoz.org search plugin - incorporate the dmoz.org
> title/category/descr
[
http://issues.apache.org/jira/browse/NUTCH-95?page=comments#action_12361300 ]
byron miller commented on NUTCH-95:
---
Number 2 sounds great, but wouldn't you always want the latest scoring document
since that should reflect the latest updatedb and rank of
Hi,
looks like the jira is down.
We already discussed that a NPE is thrown in case the remote method
does not return any values.
I also note this line in client.java
public Writable[] call(Writable[] params, InetSocketAddress[] addresses)
throws IOException {
if (params.length == 0) ret