is 'crawl.gen.delay' still being used anywhere? cause I can't find
anything in the source code except for here:
package org.apache.nutch.crawl;
public class GeneratorJob extends NutchTool implements Tool {
public static final String GENERATOR_TOP_N = "generate.topN";
public static final St
[
https://issues.apache.org/jira/browse/NUTCH-1598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13741154#comment-13741154
]
Lewis John McGibbney commented on NUTCH-1598:
-
great work Markus. This is dyna
I started reading the Nutch docs and noticed a little typo.
>From http://nutch.apache.org/#What+is+Apache+Nutch%3F
"Being pluggable and modular of course has it's benefits"
should be
"Being pluggable and modular of course has its benefits"
It's my bad,
I discovered that manually entered keys had extra http at the end, so
Nutch wasn't able to see those as host keys and skipping them.
On 08/14/2013 11:23 PM, Ahmet Emre Aladağ wrote:
Hi,
I added additional mtdt:_hr_ records in HBase holding scores externally.
To get the score s
4 matches
Mail list logo