[
https://issues.apache.org/jira/browse/NUTCH-1534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Roland updated NUTCH-1534:
--
Environment:
nutch 2.1 / cassandra 1.2.1 / gora-cassandra 0.2 / gora-core 0.2.1
running fetch with parse=true
f
[
https://issues.apache.org/jira/browse/NUTCH-1534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13585688#comment-13585688
]
Roland commented on NUTCH-1534:
---
Because of the ConcurrentModificationException I changed my
@Tejas +1
I think:
Keep Property
-
- generate.max.count. keep it because it still used GeneratorJob, Reducer.
- GENERATOR_MAX_COUNT
Deprecate Property
--
- GENERATOR_MIN_SCORE
- GENERATOR_COUNT_VALUE_IP
Add in nutch-default.xml
-
Hi Lewis,
We have not came to a conclusion for this topic.
Here is what I propose:
1. keep "generate.max.count"
2. GENERATOR_MIN_SCORE and GENERATOR_MAX_COUNT: once we get to know that if
they were kept back in 2.x for some valid reason, then we can safely remove
these params. These seem to do not
[
https://issues.apache.org/jira/browse/NUTCH-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13585467#comment-13585467
]
Tejas Patil commented on NUTCH-1031:
Hi Sebastian,
Thanks for your time and suggesting
5 matches
Mail list logo